Figure 1 



CGAGAGACGACAGAAGGTTACGGCTGCGAGAAGACGACAGAAGGGTCCAGAAAAA 

GGAAAGTGCTGGAGGGGAGTGGGGACAAAAGCAGCGACCAAGTGAATGTCACTTC 

AGTGACTGAGGCCAGGCAAAACGCGCGGGAAGGATTTTGTGTAGCTTGGGACCCTT 

TCATAGACACTGATGACACGTTTACGCAAAATAGAAATTTGAGGAGAAACGCCTGG 

GCCTTCGGAAAGGAGTGATTGATTAGTACTTGCAAGTTTAGGTGACTTTAAGGAGAA 

CTAACTAATGTATACTATTGAGGGAGGAGGAAGAGCATTACAGAGTTTCCAGCAGC 

AGCAGGAAAGCTTTGGTTAATTTGGAAATGGATGATAGCATTAAAATAACAGAAGC 

GCCTCCAGGTCTCTGAAGCTTCAGTCCCCCAGCTGAAAGCCAGAAAAGACTAAGCC 

CACTAAGCCTTTTGATCCCTTTGGAAGCAAAGAACTTTCCTTCCCTGGGGTGAAGAC 

TCTCCfCAGAAGATTTCCTGTCTCTGCCTATGTTACAAGAGGAATCAAAACCAAGAC 

AGAAGAGCTCAGGATGCAGGTGAGAGGCAGGGAAGTCAGCGGCTTGTTGATCTCCC 

TCACTGCTGTCTGCCTGGTGGTCACCCCTGGGAGCAGGGCCTGTCCTCGCCGCTGTG 

CCTGCTATGTGCCCACAGAGGTGCACTGTACATTTCGGTACCTGACCTCCATCCCAG 

ATGGCATCCCGGCCAATGTGGAACGAATAAATTTAGGATATAACAGCCTTACTAGAT 

TGACAGAAAACGACnTGATGGCCTGAGCAAACTGGAGTTACTCATGCTGCACAGT 

AATGGCATTCACAGAGTCAGTGACAAGACCTTCTCGGGCTTGCAGTCCTTGCAGGTC 

TTAAAAATGAGCTATAACAAAGTCCAAATCATTCGGAAGGATACTTTCTACGGACTC 

GGGAGCTTGGTCCGGTTGCACCTGGATCACAACAACATTGAATTCATCAACCCTGAG 

GCCTTTTATGGACTTACCTCGCTCCGCTTGGTACATTTAGAAGGAAACCGGCTCACA 

AAGCTCCATCCAGACACATTTGTCTCATTAAGCTATCTCCAGATATTTAAAACCTCTT 

TCATTAAGTACCTGTTCTTGTCTGATAAOTCCTGACCTCCCTCCCAAAAGAAATGGT 

CTCCTACATGCCAAACCTAGAAAGCCTGTATTTGCATGGAAACCCATGGACCTGTGA 

CTGCCATTTAAAGTGGTTGTCTGAGTGGATGCAGGGAAACCCAGATATAATAAAAT 

GCAAGAAAGACAGAAGCTCTTCCAGTCCTCAGCAATGTCCCCTTTGCATGAACCCCA 

GGATCTCTAAAGGCAGACCCTTTGCTATGGTACCATCTGGAGCTTTCCTATGTACAA 

AGCCAACCATTGATCCATCACTGAAGTCAAAGAGCCTGGTTACTCAGGAGGACAAT 

GGATCTGCCTCCACCTCACCTCAAGATTTCATAGAACCCTTTGGCTCCTTGTCTTTGA 

ACATGACANANNTNTCTGGAAATAAGGCCGACATGGTCTGTAGTATCCAAAAGCCA 

TCAAGGACATCACCAACTGCATTCACTGAAGAAAATGACTACATCATGCTAAATGC 

GTCATTTTCCACAAATCTTGTGTGCAGTGTAGATTATAATCACATCCAGCCAGTGTG 

GCAACTTCTGGCITrATACAGTGACTCTCCTCTGATACTAGAAAGGAAGCCCCAGCT 

TACCGAGACTCCTTCACTGTCTrCTAGATATAAACAGGTGGCTCTTAGGCCTGAAGA 

CATTTTTACCAGCATAGAGGCTGATGTCAGAGCAGACCCTTTTTGGTTCCAACAAGA 

AAAAATTGTCTTGCAGCTGAACAGAACTGCCACCACACTTAGCACATTACAGATCCA 

GTTTTCCACTGATGCTCAAATCGCTTTACCAAGGGCGGAGATGAGAGCGGAGAGAC 

TCAAATGGACCATGATCCTGATGATGAACAATCCCAAACTGGAACGCACTGTCCTGG 

TTGGCGGCACTATTGCCCTGAGCTGTCCAGGCAAAGGCGACCCTrCACCTCACTTGG 

AATGGCTTCTAGCTGATGGGAGTAAAGTGAGAGCCCCTTACGTTAGCGAGGATGGG 



CGAATCCTAATAGACAAAAATGGGAAGTTGGAACTGCAGATGGCTGACAGCTTTGA 

TGCAGGTCTTTACCACTGCATAAGCACCAATGATGCAGATGCGGATGTTCTCACATA 

CAGGATAACTGTGGTAGAGCCCTATGGAGAAAGCACACATGACAGTGGAGTCCAGC 

ACACAGTGGTTACGGGTGAGACGCTCGACCTTCCATGCCTTTCCACGGGTGTTCCAG 

ATGCTTCTATTAGCTGGATTCTTCCAGGGAACACTGTGTTCTCTCAGCCATCAAGAG 

ACAGGCAAATTCTTAACAATGGGACCTTAAGAATATTACAGGTTACGCCAAAAGAT 

CAAGGTCATTACCAATGTGTGGCTGCCAACCCATCAGGGGCCGACTTTTCCAGTTTT 

AAAGTTTCAGTTCAAAAGAAAGGCCAAAGGATGGTTGAGCATGACAGGGAGGCAG 

GTGGATCTGGACTTGGAGAACCCAACTCCAGTGTTTCCCTTAAGCAGCCAGCATCTT 

TGAAACTCTCTGCATCAGCTTTGACAGGGTCAGAGGCTGGAAAACAAGTCTCCGGTG 

TACATAGGAAGAACAAACATAGAGACTTAATACATCGGCGGCGTGGGGATTCCACG 

CTCCGGCGATTCAGGGAGCATAGGAGGCAGCTCCCTCTCTCTGCTCGGAGAATTGAC 

CCGCAACGCTGGGCAGCACTTCTAGAAAAAGCCAAAAAGAATTCTGTGCCAAAAAA 

GCAAGAAAATACCACAGTAAAGCCAGTGCCACTGGCTGTTCCCCTCGTGGAACTCA 

CTGACGAGGAAAAGGATGCCTCTGGCATGATTCCTCCAGATGAAGAATTCATGGTTC 

TGAAAACTAAGGCTTCTGGTGTCCCAGGAAGGTCACCAACTGCTGACTCTGGACCAG 

TAAATCATGGTTTTATGACGAGTATAGCITCTGGCACAGAAGTCTCAACTGTGAATC 

CACAAACACTACAATCTGAGCACCTTCCTGATTTCAAATTATTTAGTGTAACAAACG 

GTACAGCTGTGACAAAGAGTATGAACCCATCCATAGCAAGCAAAATAGAAGATACA 

ACCAACCAAAACCCAATCATTATCITrCCATCAGTAGCTGAAATTCGAGATTCTGCT 

CAGGCAGGAAGAGCATCTTCCCAAAGTGCACACCCTGTAACAGGGGGAAACATGGC 

TACCTATGGCCATACCAACACATATAGTAGCTTTACCAGCAAAGCCAGTACAGTCTT 

GCAGCCAATAAATCCAACAGAAAGTTATGGACCTCAGATACCTATTACAGGAGTCA 

GCAGACCTAGCAGTAGTGACATCTCTTCTCACACTACTGCAGACCCTAGCTTCTCCA 

GTCACCCTTCAGGTTCACACACCACTGCCTCGTCTTTATTTCACATTCCTAGAAACAA 

CAATACAGGTAACTTCCCCTTGTCCAGGCACTTGGGAAGAGAGAGGACAATTTGGA 

GCAGAGGGAGAGTTAAAAACCCACATAGAACCCCAGTTCTCCGACGGCATAGACAC 

AGGACTGTGAGGCCAGCAATCAAGGGACCTGCTAACAAAAATGTGAGCCAAGTTCC 

AGCCACAGAGTACCCTGGGATGTGCCACACATGTCCTTCCGCAGAGGGGCTCACAG 

TGGCTACTGCAGCACTGTCAGTTCCAAGTTCATCCCACAGTGCCCTCCCCAAAACTA 

ATAATGTTGGGGTCATAGCAGAAGAGTCTACCACTGTGGTCAAGAAACCACTGTTAC 

TATTTAAGGACAAACAAAATGTAGATATTGAGATAATAACAACCACTACAAAATAT 

TCCGGAGGGGAAAGTAACCACGTGATTCCTACGGAAGCAAGCATGACTTCTGCTCC 

AACATCTGTATCCCTGGGGAAATCTCCTGTAGACAATAGTGGTCACCTGAGCATGCC 

TGGGACCATCCAAACTGGGAAAGATTCAGTGGAAACAACACCACTTCCCAGCCCCC 

TCAGCACACCCTCAATACCAACAAGCACAAAATTCTCAAAGAGGAAAACTCCCTTG 

CACCAGATCTTTGTAAATAACCAGAAGAAGGAGGGGATGTTAAAGAATCCATATCA 

ATTCGGTTTACAAAAGAACCCAGCCGCAAAGCTTCCCAAAATAGCTCCTCTTTTACC 

CACAGGTCAGAGTTCCCCCTCAGATTCTACAACTCTCTTGACAAGTCCGCCACCAGC 

TCTGTCTACAACAATGGCTGCCACTCAGAACAAGGGCACTGAAGTAGTATCAGGTG 



CCAGAAGTCTCTCAGCAGGGAAGAAGCAGCCCTTCACCAACTCCTCTCCAGTGCTTC 

CTAGCACCATAAGCAAGAGATCTAATACATTAAACTTCTTGTCAACGGAAACCCCCA 

CAGTGACAAGTCCTACTGCTACTGCATCTGTCATTATGTCTGAAACCCAACGAACAA 

GATCCAAAGAAGCAAAAGACCAAATAAAGGGGCCTCGGAAGAACAGAAACAACGC 

AAACACCACCCCCAGGCAGGTTTCTGGCTATAGTGCATACTCAGCTCTAACAACAGC 

TGATACCCCCTTGGCTTTCAGTCATTCCCCACGACAAGATGATGGTGGAAATGTAAG 

TGCAGTTGCTTATCACTCAACAACCTCTOTCTGGCCATAACTGAACTGTTTGAGAA 

GTACACCCAGACTTTGGGAAATACAACAGCTTTGGAAACAACGTTGTTGAGCAAAT 

CACAGGAGAGTACCACAGTGAAAAGAGCCTCAGACACACCACCACCACTCCTCAGC 

AGTGGGGCGCCCCCAGTGCCCACTCCTTCCCCACCTCCTTTTACTAAGGGTGTGGTT 

ACAGACAGCAAAGTCACATCAGCTTTCCAGATGACGTCAAATAGAGTGGTCACCAT 

ATATGAATCTTCAAGGCACAATACAGATCTGCAGCAACCCTCAGCAGAGGCTAGCC 

CCAATCCTGAGATCATAACTGGAACCACTGACTCTCCCTCTAATCTGTTTCCATCCAC 

TTCTGTGCCAGCACTAAGGGTAGATAAACCACAGAATTCTAAATGGAAGCCCTCTCC 

CTGGCCAGAACACAAATATCAGCTCAAGTCATACTCCGAAACCATTGAGAAGGGCA 

AAAGGCCAGCAGTAAGCATGTCCCCCCACCTCAGCCTTCCAGAGGCCAGCACTCAT 

GCCTCACACTGGAATACACAGAAGCATGCAGAAAAGAGTGTTTTTGATAAGAAACC 

TGGTCAAAACCCAACTTCCAAACATCTGCCTTACGTCTCTCTACCTAAGACTCTATTG 

AAAAAGCCAAGAATAATTGGAGGAAAGGCTGCAAGCTTTACAGTTCCAGCTAATTC 

AGACGTTTTTCTTCCTTGTGAGGCTGTTGGAGACCCACTGCCCATCATCCACTGGACC 

AGAGTTTCATCAGGANTTGAAATATCCCAAGGGACACAGAAAAGCCGGTTCCACGT 

GCTTCCCAATGGCACCTTGTCCATCCAGAGGGTCAGTATTCAGGACCGTGGACAGTA 

CCTGTGCrCTGCATTTAATCCACTGGGCGTAGACCATTTTCATGTCTCTTTGTCTGTG 

GTTTTITACCCGGCAAGGATTTTGGACAGACATGTCAAGGAGATCACAGTTCACm 

GGAAGTACTGTGGAACTAAAGTGCAGAGTGGAGGGTATGCCGAGGCCTACGGTTTC 

CTGGATACTTGCAAACCAAACGGTGGTCTCAGAAACGGCCAAGGGAAGCAGAAAGG 

TCTGGGTAACACCTGATGGAACATTGATCATCTATAATCTGAGTCTTTATGATCGTG 

GTTTTTACAAGTGTGTGGCCAGCAACCCATCTGGCCAGGATTCACTGTTGGTTAAGA 

TACAAGTCATCACAGCTCCCCCTGTCATTATAGAGCAAAAGAGGCAAGCCATCGTTG 

GGGTTTTAGGTGGAAGTTTGAAACTGCCCTGCACTGCAAAAGGAACTCCCCAGCCTA 

GTGTTCACTGGGTCCTTTATGATGGGACTGAACTAAAACCATTGCAGTTGACTCATT 

CCAGATTTTTCTTGTATCCAAATGGAACTCTC^^ 

GAGGGGCACTTATGAGTGCATTGCCACCAGCTCCTCAGGCTCAGAGAGAAGGGTAG 

TGATTCTTACTGTGGAAGAGGGAGAGACAATCCCCAGGATAGAAACTGCCTCTCAG 

AAATGGACTGAGGTGAATTTGGGTGAGAAATTACTACTGAACTGCTCAGCTACTGG 

GGATCCAAAGCCTAGAATAATCTGGAGGCTGCCATCCAAGGCTGTCATCGACCAGT 

GGCACAGAATGGGCAGCCGAATCCACGTCTACCCAAATGGATCCTTGGTGGTTGGG 

TCAGTGACGGAAAAAGACGCTGGTGACTACTTATGTGTGGCAAGAAACAAAATGGG 

AGATGACCTAGTCCTGATGCATGTCCGCCTGAGATrGACACCTGCCAAAATTGAACA 

GAAGCAGTATTTTAAGAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTTGACTGCA 



AGGCCTCTGGCTCCCCTGTGCCTGAGGTATCCTGGAGTTTGCCTGATGGGACAGTGC 

TCAACAATGTAGCCCAAGCTGATGACAGTGGCTATAGGACCAAGAGGTACACCCTT 

TTCCACAATGGAACCTTGTATTTCAACAACGTTGGGATGGCAGAGGAAGGAGATTAT 

ATCTGCTCTGCCCAGAACACCTTAGGGAAAGATGAGATGAAAGTCCACCTAACAGT 

TCTAACAGCCATCCCACGGATAAGGCAAAGCTACAAGACCACCATGAGGCTCAGGG 

CTGGAGAAACAGCTGTCCTTGACTGCGAGGTCACTGGGGAACCGAAGCCCAATGTA 

TTTTGGTTGCTGCCTTCCAACAATGTCATTTCATTCTCCAATGACAGGTTCACATTTC 

ATGCCAATAGAACTTTGTCCATCCATAAAGTGAAACCACTTGACTCTGGGGACTATG 

TGTGCGTAGCTCAGAATCCTAGTGGGGATGACACTAAGACATACAAACTGGACATT 

GTCTCTAAACCTCCATTAATCAATGGCCTGTATGCAAACAAGACTGTTATTAAAGCC 

ACAGCCATTCGGCACTCCAAAAAATACTTTGACTGCAGAGCAGATGGGATCCCATCT 

TCCCAGGTCACGTGGATTATGCCAGGCAATATTTTCCTCCCAGCTCCATACTTTGGA 

AGGAGAGTCACGGTCCATCCAAATGGAACCTTGGAGATGAGGAACATCCGGCTTTC 

TGACTCTGCGGACTTCACCTGTGTGGTTCGGAGCGAGGGAGGAGAGAGTGTGTTGGT 

AGTGCAGTTAGAAGTCCTAGAAATGCTGAGAAGACCAACATTCAGAAACCCATTCA 

ACGAAAAAGTCATCGCCCAAGCTGGCAAGCCCGTAGCACTGAACTGCTCTGTGGAT 

GGGAACCCCCCACCTGAAATTACCTGGATCTTACCTGACGGCACACAGTTTGCTAAC 

AGACCACACAATTCCCCGTATCTGATGGCAGGCAATGGCTCTCTCATCCTTTACAAA 

GCAACTCGGAACAAGTCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGCTA 

CATCGAGAAACTCATCCTGTTAGAGATTGGGCAGAAGCCAGTCATTCTGACATACGA 

ACCAGGGATGGTGAAGAGCGTCAGTGGGGAACCGTTATCACTGCATTGTGTGTCTG 

ATGGGATCCCCAAGCCAAATGTCAAGTGGACTACACCGGGTGGCCATGTAATCGAC 

AGGCCTCAAGTGGATGGAAAATACATACTGCATGAAAATGGCACGCTGGTCATCAA 

AGCAACAACAGCTCACGACCAAGGAAATTATATCTGTAGGGCTCAAAACAGTGTTG 

GCCAGGCAGTTATTAGCGTGTCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAA 

ACTACCTACCCAGGAACATGCTCAGGAGGACAGGGGAAGCCATGCAGCTCCACTGT 

GTGGCCTTGGGAATCCCCAAGCCAAAAGTCACCTGGGAGACGCCAAGACACTCCCT 

GCTCTCAAAAGCAACAGCAAGAAAACCCCATAGAAGTGAGATGCTTCACCCACAAG 

GTACGCTGGTCATTCAGAATCTCCAAACCTCGGATTCCGGAGTCTATAAGTGCAGAG 

CTCAGAACCTACTTGGGACTGATTACGCAACAACTTACATCCAGGTACTCTGACAGG 

AAGGGGGAGACTAAAATTCAACAGAAGTCCACATCCACAGGGTTTATTTTTTGGAA 

GAAGTTTAATCAAAGGCAGCCATAGGCATGTAAATGAGTCTGAATACATTTACAGT 

ATTAAATTTACAATGGACATGCGATGAGACTTGTAAATGAAAGCATTGTGAACTGA 

AACCGAGTCTCTGTGGATCTCAAAGCAAACTOTAACTTAAGGCA(mTGATITrGC 

CAACAAATAATAACAAACATTAAGAGAAAAAAATGATCCACTACGAAATAACAAAC 

GGCTAATGCACCTGAATTCTCAGTAAAAAGACCTTTCTCTCGCTAACAGTTGCCAGC 

TGCCTCGTGTCTGTTTCCTACCAATGTCACAAACATCGCACACAGGGTGAATGGAGT 

CAACGGGAAAGATTAAGTTTGCGGTCTGTGTAAATCTCAATGTACAAATATTCTGTC 

NCTGGTTTATAAACATTTTGATAAAACCGAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAA (SEQ ID NO: 1) 
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MQWGREVSGmSLTAVCLVWPGSRACPRRCACYVPTEVHCTFRYLTSIPDGIPANVE 

RD^GYNSLTE^TENDFDGl^KIJELLMLHSNGimVSDKTFSGLQSLQVLKMSYNKVQn 

RKDITYGLGSLVRLHmHNNIEFINPEAFYGLTSLRLVHLEGNRLTKLHPDTFVSLSYLQI 

FKTSFIKYLFI^DNFLTSLPKEMVSYMPNLESLYLHG^^ 

KCKJQDRSSSSPQQCPLCM^ 

ASTSPQDFffiPFGSI^L^MXXSGNKADMVCSIQKPSRTSF^AFTEENDYIMLNASFSTNL 

VCSVDYNfflQPWQlXALYSDSPLlI^RKPQLTETPSLSSRYKQVALRPEDIFTSffiADVR 

ADPFWFQQEKWLQLNRTATTLSTLQIQFSTDAQIALPRAEMRAER^ 

LERTVLVGGTIAI^CPGKGDPSPHIJEW1I.ADGSKVRAPYVSEDGRILIDKNGKLELQMA 

DSFDAGLYHCISTNDADADVLTYRITWEPYGESTHDSGVQHTWTGETLDLPCLSTGV 

PDASISWILPGNTWSQPSRDRQIlJWGTIi^QVTPKDQGHYQCVA 

SVQKKGQPJvlVEHDP^AGGSGLGEPNSSVSLKQPASLKLSASALTGSEAGKQVSGVHRK 

NEmDLIHRRRGDSTLRRFREHRRQIPI^ARRIDPQRWAAIXEKAKKNSVPKKQENTT 

KPVPLAWLVELTDEEKDASGM1PPDEEFMVLKTKASGVPGRSPTADSGPVNHGFMTSI 

ASGTEVSTVNPQTLQSEHIJPDFKIJSVTNGTAVTKSMNPSIASKIEDTTNQNPIIIFPSVAE 

IRDSAQAGRASSQSAHPWGGNMATYGHTOTYSSFTSKASTVLQPINPTESYGPQIP^ 

SRPSSSDISSHTTADPSFSSHPSGSHTTASSIJTHmRNNNTGNFPl^RHLGRERTIWSR 

KNPmTPVIJaiHRmTVRPAIKGPANKNVSQWATEYPGMCOT 

WSSSHSALPKTNNVGVIAEESTTVVKIO^IiLFKDKQNVDffinTTTTKYSGGESNHWTE 

ASMTSAPTSVSmKSPVDNSGHI^MPGTIQTGKDSVETTPLPSPI^TPSIPTSTKFSKRKTP 

mQIFVNNQKKEGMIJKNPYQFGLQK^ 

TMAATQNKGTEVVSGARSI^AGKKQPFTNSSPVI^STISKRSNTLNFLSTETPTVTSPTAT 

ASVMSETQRTRSKEAKDQKGPRKNRNNANTTPRQVSGYSAYSALTTADTPLAFSHSP 

RQDDGGNVSAVAYHSTTSIXAITELFEKYTQTLGNTTALETTLLSKSQESTTVKRASDTP 

PP1XSSGAPPWTPSPPPFTKGWTDSKVTSAFQMTSNRWTIYESSRHNTDLQQPSAEAS 

PNPEIITGTTDSPSNLFPSTSWALRVDKPQNSKWKPSPWPEHKYQLKSYSETIEKGKRPA 

VSMSPHLSLPEASTEL\SHWNTQKHAEKSVFDKKPGQNPTSKHLPYVSI^ 

GKAASFTWANSD\^PCEAVGDPLP]IEIWTRVSSGXEISQGTQKSRFHVLPNGTLSIQRV 

SIQDRGQYLCSAFNPLGVDHFHVSI^VWYPARIlI)RHVKErrVHFGSTVELKCRVEGMP 

RPTVSWILANQTVVSETAKGSRKWVTPDGTLnYNI^LYDRGFYKCVASNPSGQDSLL 

VKIQVITAPPVIIEQKRQAIVGVLGGSLKLPCTAKGTPQPSVHWVLYDGTELKPLQLTHS 

RFFLYPNGTLYIRSIAPSVRGTYECIATSSSGSERRVVILTVEEGETIPRIETASQKWTEVN 

LGEKLIiNCSATGDPia'RIIWRLPSKAVIDQWFimGSPJHVYPNGSLVVGSVTEKDAGD 

YLCVARNmGDDLVLMHVRLRLTPAKIEQKQYFKKQVLHGKDFQVDCKASGSPVPEV 

SWSLPDGTVLNNVAQADDSGYRTKRYTIJ^GTLYFNNVGMAEEGDYICSAQNTLGK 

DEMKVHLTVLTAIPRIRQSYKTTMRLRAGETAVLDCEVTGEPKPNVFWLLPSNNVISFS 

NDRFTFHANRTI^IHKVKPLDSGDYVCVAQNPSGDDTKTYKIJ)IVSKPPLINGLYANKT 

VIKATAIRHS KKYFD CR ADGIPS S QVTWIMPGN1FLPAP YFGSRVTVHPNGTLEMRNIRLS 



DSADFTCVVRSEGGESVLWQIJEVLEMLRRPTFRNPFNEKVIAQAGKPVAI^CSVDGIS^ 
PPEITWILPDGTQFANRPHNSPYmAGNGSLlLYKATRNKSGKYRCAARNKVGYffiKL^ 
DBIGQKPVILTYEPGMVKSVSGEPI^LHCVSDGIPKPNWWTTPGGHVIDRPQVDGKYIL 
HENGTLVIKATTAHDQGNYICRAQNSVGQA^ 

AMQLHCVALGIPKPKVTWETPRHSLLSKATARKPHRSEMLHPQGTLVIQNLQTSDSGVY 
KCRAQNLLGTDYATTYIQVL (SEQ ID NO: 2) 
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TTIXXIAACCAACCXAGAIXXXXXTO 
TCC^TCGAArACTACIX^CmTTAAAA^ 
GNATGGTCTGNAQGATCTTCAGItUOOT 
ACTCACTGATAAGTGGCTATTTGT 

TTITroTGTOTATAATATTAAQATOGAAATTTG<X^GTGCTO 
TQTTlX^QTGGATrACCCK}AAGAAAATAATAAGTQAACTCTAG<lAA.QAA 



AOQTGAC^AAQTATCC3t^CAtMTT QflGAAA^^ 
<KSATTCATATTAACICCA0TTCC<X1ATGTGCAGCTCTO 
AACTTreACCAGCTTCCTACACAGAGCXOTTACrATCCTTt^ 
pCATTAAOCTXKlACroAAACATCAGTClTOACTT^ 
TOTAAAGTATOTGACCAAAGAGCATGOTCVr^ 
TAATTTTAT<XmGAAATC?^CAA6(^^ 
OATGGOCrTTCAQTTTTOAAAOTAATn^GTATGTOTT^ 
ATTTAAKXlCCX^VAGTAGGCATAATTTTAATGCnTAC^ 
TCTlCTAAAAAOACTTTtK}TATGCATAG(UTCTAA^ 
AAATArK}AQAAACroAGCrAOAATAGOTATAAAATATOTGCTG<3CITTCT 
ATAGGTTATATAAGAGOTOGGAMOC^ 
AGTA<KX3AT0CTOGGAAAATACTTACTCTAACACACI^^ 
GOA(XnX}A<XnraXXiAAO<K^^ 
GAogrrAGTCT<raAAGQAGcrrrTqo^ 
GATGCCATATTrCCAAO<3CAAGTGTCCAGATO<XX^ 

CA<flXK7rAAGGACCrrOTT^^ 

TTOACTCTTrAATTCTAOCTTGATCATTAATGACTAOA 

GATATGGAATATTACTAATTTGTACCCAAOCCAG^ 

ATTCTffrOOAGCa^GTCACGT^ 

ATTACCTGAOATGAGTCATTTTrcnX^A£AAGTCTTTT^ 

CTGATGAAGTAAQCATATAAAGTGAQAGCAOCATGA^ 

TOCTATTATAATGTGOAAATAAACTOACrTTAAAAAAA^ 

OAqTAAATCTrCCATAAATATCATCTO^TlTATAAAm 

AAAanrTTAOAAATmGTTAACV^CAAAGAttXXn^ 

TGGTQGGCTTOAGAOATrTGAOCTAAOAACrTAmxy^^ 

ATCKXXnCAAAACAITCTATaXAA^ 

ACACXK3ArTAAACAGTCATTrrAGTX^CArnCTC^^ 

CAGKjATOTATC^OQATAACATC^ 

AACAAC^CX?ACAACAAACCCATAAAAAITATCA0CKX1AAATOAACT 
QAAAAAGTATTATATQTICTtlA^^ 

ATAGACAAAGATCKnTTATOCAAGTGGGGATOGCQAAOGATACTT^ 

CAAAGCAACAACTAC^QAGTOAAQTAATCCAGAfMCTTAATGT^ 

TTTAATAATTCTATTTAAGATACAGAOCAA ACGAGTG^ 

ATAOAOAOAATAAAGTCATA QATACGiTimTlTAT^^ 

mACA(XTKAAACATAOATAACCAAC^AAOAXX3AAGAG^ 

<XjAAGAOTACATTCXIAAAGTX^GACACCATTCr^ 

GAAAOTAGQ^AAGGAGCTAAAG^ATCTOCAA(XCTATAGGTGCAACAACATTA 

AAO^OTACCXX^OGAGCrCTXiACTCTAGCT^TATA^ 

CCATCACTOGAAAGAGAOGmiATIX^ 

ATACCAG<XXX^AAAAGGGGGACrrOOOTGG^ 

GGACTTTKXyrATAGCATTGQAAATtnAAATOAG^ 

A Nae»17 

AAAAAAAAAAAAAAAAAAAAAGGAAGGTCAGACACCtCACTTCA 
<K^AAXXGAAGGO<^GTCACAAACCCAG 
TGTTATTGCIX?rTGTIGTrACCACCATX:ATTAGCATATAlirCA 
CnATGACATGTTTTITrATTCAAaTATATCACA 




CKX^CCAG<XXXXrrTTGCCCCGCCXX:CAACAC^C^ 

CACACACACA^CACACACACACACCTm^^ 

CACXXSAAQCTTAATCXTrTTOCnTtXSCTGCAATAGTA^ 

CCCCCnXXXCTGTTAG^GATOGGAOAGGT^ 

CKX^CKXXX3G<XX^CAGAGCCrXKXACCAOAC^ 

GGAflGAGCAAACKXK^qOCQAG<xrr^^ 

CCCATTOCCAQAAGCXZIGAGGCTATAqCTtX^j^^ 

OAGCGOCT(XTAATXXXXr[C<3G<3AAGTOAAOOGACCCA 

irExon 1 (Nbc 3661) 

GCTtXKK}T TCAGGQCQAGACAGAO<KnXXiQAT^^ 
ACTGGAGGCKlAXriTXXWACAAAAGCAXXXIi^ 

GGCAAAAGGAGCGGOAAGGATTATATGTAGCCTGGGACGCrTTCATAAAt^CTGATC 
ITGl<(K!AAAGCAAG<^AITTGAG^ 

AOTACTT<nAAGTTrAGGTGAGTITC^GAACrAACrAACCrAtACTA 
A<XATTCCAGCAG<^G<^GCAGCAG<^GCAATCAGATAAAGGAAAGCT^ 
ATGTATGATAOCATTAAAATAACAGA^ 
CW^C&CTAAGCCTACTAAGCCTmXX^^ 
TGAAGGCrCIXXTCAGAAGATTTCCTGTIXnxntXrrTATGTTA 
AGAAGAGCTCAGGTATTGCCAACTCT7T 



A Nw4203 

AO! 



TGTTAAATACAGTTTGAGGCITAAGTGTAKX^ 
CTTCTCriTATAACrAACTCITAA 
CATtTTTOTrrATCTAATAATAGOA^ 

f^A AAA ATATTPA mYiA tTJTA APA rVTVJ A A ryrrt^T A ^WATA rSA A rtTA ITV-AfTA/Tr* a att* 




CACGTGCOOflTTAAAGATATITrC^^ 
CCATCACTreTGTCAGTAA(£AGAOQ^ 

TCKXnxy^AAQTCAOAGACTrAOCa^ ^ 
AGTTG^CTXTTOAAAAGGaX^ATGTGTC^ 
GAAATTTQACCTTTnTrCTAgrG^m 

gCAAATAAATAAAACA(^TATTAAATAATAr^ATGTTACAAAAATTQATCATATAAAGAA 
OTACATTCATAAAKK^TtmiAAA^ 

TTrTCTAAATTOCACATOTACTATTTTTATATTTATAAAAATATTm 

AAC^TTATTATGATCTCTAATGAGAOGG ITCYZTGCTTTTCA TTGGG A AATOAOA AGGT A TO 

AAGATATGTTTAG^TAATAAGCCXlATrTAn TOATAAAGl^ 

ACTGGmATAAA^QAACJVGTTTC^ 

AOTTOCTOTraraGATT^ 

CATTATAA(yroqAATAAGATAOGATCTr.A AAGGAATGTGAATTTGGATGCCCTGAACA AT 

ACCrCTCAAACTAGAACTArcAATa^^ 
GAATTrATreClACATTGTTC^^ 

gQCTAATGAAAACAAOACAC^CT C AA^ 

*^ AA ^^ 




r^yy.Hi^vuowu imu i ivti A AlTAAAl\JAC.AOACA G< 

AA ^ < f^* AA TC^<X*^^^ 
^fr^^TTTOCAGTTCfT^^ 




CTTQAACTQCKXCTATAA^ 
TGTCXTO?a^TClXXXTGCTATC 

GGACXOTKXTOAJCI^^ 

AAACACCIXH^TTA^ 

TGOAGAC^TOAACATmATOX^TAGTCT^ 

GTTIXX3GAGCCTCATMmTG^ 

ACAarTTCTGTTAACAATCATrATGATATCAAGGA^^ 

AAAAGCATTTCTITroCAAm 

CCTfACTAAATCn^CAAATTATGGT^ 

ATAAATGCAC^GAAATQOaATATG<XATCACCT 

ATQCCAMCACTATTOTTCACGlTItWAG^ 

TATAACCOfrATACCCAGTG<^^ 

CTCm , ATGAG<mG<KKjCCTO 

GGCAGGlTrrATOATAACrim^ 

GAGAACATGCCK3AAGAA<UTCAGATC^^ 

TGATTCATG<K}GCAGGGCCCAGTCC^ 

AAAAAAAAAC^<KKTOAGCAAGTGTC^ 

TCTCTOCATCAGCincXJIX^ 

CAGTGATXSAACAATGATOIGGAACnATAAGCC^ 

TTGGTCATGATGTTTCATCACAGTGAT 

cerrrcmcTAAATixxixnTATCT 

CTACACCAGAGTGTAATCTGGG<X7roAGCI^ 
GGGGCTn^AACCGGAACACAGTTGTAAAAGK^ 

TGTGGACTITAATT(MCGATrCATCreM 

CTCACTAGATTGACAGAAAATCAC^ 

(X^T(XX&TTCACA<^^ 

GTGAG&TAGGTAGAGGGTGATGGAGGCIX^^ 

AAAAGCTOCTCATTCCCG^ 

TCAAG<UAGGATTTCMTTGTAATGCAAT^ 

CTGGGGAAAATGCTTACACAATtGCGAGGACCTAA 

AAAGCACAGC^TGGCGGCVGACACCTGCA^ 

CC^GAGACTGATTGGCCAAACACTCT^ 

ACTTOACrCAGAAACTAAT0TX3GAM^ 

C^TGCATGAATAAGGGATCCCAGAGAGAACKKJAAGAA^ 

AAdGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGAGAGGGAGGAA^ 

AAGGAAGGAAAGGGAAAGGAAAAMGAGATGGGGAGGGAGGGAAGG 

GGAGAAAGMGAGAAGAAACKLUAATAAATAAATTTr^ 

TTTTATCC^TAAAAGGTCATTTCX^ 

^GGGCAGTCmAACATAG 

TTTTTATXXnCTACX&TCT^ 

TGTAAGAGTITIX^CACAC^ 

MTGGGGAGAACAAAAA^GACTOCAC^ 

TraAGAAGCTAAITGA(^TGCATGA 

ATGAAAAGCACAATAAAGTACATATGlXXX^ 

ATAATAGCAATCAAAGCAATAA(^<XCACIX^^ 

ACTATGTCTAGKHTnt^CTT^^ 

ATTTAGCnLCATCAGACAG^TrTGGA^ 

G<^TOAAAT TClC(^ CCrATGm 

GGGTTrGAGGTTXTACIXXCAGGTAACATITAq 

CrTTTATXTATGAGGGATGTTOGTATKX^ 

GGTeAAAACAGAAAAATATGTAAAMGTTATTGTITTTATO 

XTrrAAAAATGOTATGOTAGAACTAATrAAG^ 

AGAGAGGGATITGATGb^ATGCTAAAGCATC^ 

CAGAATCAATTAAATTCATATTACTATAAAAGACAG^ 

GAGTGGATAAACTGTGTAACGTGAGTGCTATGTAG " 



TQTQCQCTGGAACATCTTQAAAACATr03aCTACA.TQATO 
AC&TATIXXIATQGITATGTITATATGAAATGTTT^ 
CTTAAGAIXiTrACTrAGGGTCAGGAAAAG 
ATCITAATrTGOCKJAAAAGACAATTTCCTMGACGAM 

GCAGGAGGAAGTGAAATGAGATGGTAGAAAGGAAAGTCATATACC^^ 

GGTGGAATCTAGATATGTTAATATATTGACATAAAGGAAG^ 

AAAACCAACAGGAGTGAGGGAGAC^TAGGAACCAATGAGAG^ 

tgtgtctggagacacx^taataaaactcct: riinon i gctaactaaaaccactaaaatc 

TAAAAACAAAACATTTTTGCACAAGAATTATTTATTATTCAAT 

GAAGTTOAAGTTCAlTGATAGTCrcATAAATCTrAAATCTAm 

mATTATrAAmCTCTTGCTtSTC^^ 

ATGCITITCACCATA<^CAAATGTAG<3<^^ 

GGAOAGGGAMGCAAGCGGATGCCK^^ 

ATGAACAAGAAAAAATGTXKLVCAQVIXXIACAA^ 

TATTCTAATATTAAGAAAATAATAAACCCAT^^ 

TTATCITGGCCACAGAircrGTUrc^^ 

AATCCAATCCAGGGCATGATGCTTAClXXnACA(^GAACTAAAGATTA^ 
AAGTAAOCATGACATCrrCrcrGlTOC^ 
GTATGTlCTAATTTCaX^AATATTGTt^ 
TCK3AGTTAAG<jlxXCATA<MATTAATCIXi\ACATATTTCT 

TTGTGAAAGTTCCTTTGAGAAAACAAATATG^ 

A<CTATAACAAA<m£AAATAATTCAGA^ 

TTCK&<Xm>ATCACAACAACATIK3A^^ 

TaX3CTTCX5TACATCIAGAAGX?AAAC^ 

GAGCTATCrCOW^AmAAAAanXXTTCAlT^ 

ACCTCXXICCJCMMfiAAATCKacC^^ 

ACCXOTGGACCTTFGACT^^ 

Nw 11680 f 

ACTATCTTGTTTGTTrGTTTCTTTTTn^^ 

TATCCCAAAAGTCCCCX^TAAO^FC^^ 

CCCTGGCATTXXXXSOTACTGGGGCAmTAAAGT^^ 

AGTGAltKK5CAACTAGOCCATCTTTTGATA 

TACTGGTTAGTTCATAAlX?ITGITG<^CCTACAGGGTrGAA 

(SEQIDNO:3) 
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No. 


start 


end 


length 


length 


1 


1 


542 


542 


2356 


2 


2898 


3094 


197 


1334 


3 


4428 


4557 


130 


3068 


4 


7625 


8019 


394 
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m 



f-t;.:a; 





2 


gdaosti-c^JKu 


1 


cDtffcjrat 


51 


g*nomiojbca 


1 




101 


g«nom±cJfcu 


1 


cJSaa^rat 


151 


cjeAOJ&io lit! 


1 




201 


gsnofcic_bu 


1 




251 








301 




i 


cJD&Ajrafc 


351 








401 




1 


cDK&rat 


451 


^QXlOmXC fill 


i 




501 


gonomic cu 


« 

i 




551 




X 




601 


gonomic Hu 


1 




651 


genomic hu 


1 




701 




1 




751 


gonomlcjm 


1 




801. 


sre»omic_Jni 


1 


cW\at 


851 




1 




V01 




1 



cgagagacgacagaaggttacggctgcgagaagacgacagaagggtccag 



cttgcaagtttaggtgactttaaggagaactaactaatgtatactattga 
gggaggaggaagagcattacagagtttccagcagcagcaggaaagctttg 
gtfcaatttggaaatggatgatagcafctaaaataacagaagcgcetceagg 
totctgaagcttoagtcccccagctgaaagccagaaaagactaagGccaa 

taagacttttgatccctttggaagcaaagaactttccttccctggggtga 

agactctcctcagaagatttcctgtctct^ctatgttacaagaggaatc 

aaaaceaagacagaagagctcaggatgcaggtgagaggcagggaagtcag 

oggottgttgatQtocctoactgotgtctgo<?tggtggtcaooootggga 

gcagggootgtcctogccgctgtgocstgotatgtgcccacagaggtgcac 

tgtacatttcggtacctgacctccatccoagatggcatcccggocaatgt 

ggaacgaataaatttaggatataacagccttactagattgacagaaaacg 

actttgatggcctgagcaaactggagttactcatgctgcaoagtaatggc 

attcacagagtcagtgacaagaccttctcgggcttgcagtccttgcaggt 

cttaaaaatgagctataacaaagtcoaaatoattoggaaggatactttct 



cDN&^rat $51 



acggactcgggagcttggtccggttgcacctggatcacaacaacattgaa 



g©nomio_hTi 

gonofti cjbu 
genomic^ 

getvomlc^Jitt 

cOHArat 

cDK&_rat 

d?NA_rat 
gsnonxc 

g&nOttl.O hu 

oDHA_rat 
gonosii. c__hti 

cDNA_rat 
genome hu 



1001 
1 

1051 
1 

1101 
1 

1151 
1 

1201 
1 

1251 
1 

1301 
1 

1351 
1 

1401 
1 

1451 
l 

1501 
1 

1551 
1 

1601 
1 

16S1 
1 

1701 
1 

1751 
1 

1801 
1 

1*51 
1 

1$01 
1 

1951 
l 



ttcatcaaccctgaggccttttatggacttacctcgctccgcttggtaea 



tttagaaggaaaccggctcacaa&gctccatcc^igacacatttgtctcat 



taagotatctccagatatttaaaacctctttcattaagtacctgttcttg 



tctgataacttcctgacctccctcccaaaagaaatggtctcctacatgcc 



aaacctagaaagcctgtatttgcatggaaacceatggacctgtgactgcc 



atttaaagtggttgtctgagtggatgcagggaaacccagatataataaaa 



tgcaagaaagaoagaagctcttccagtcctcsagcaatgtcccctttgcat _ 



gaaccccaggatctctaaaggcagaccctttgctatggtacoatctggag 



ctttcctatgtacaaagccaaccattgatccatcactgaagtoaaagagc 



ctggttactcaggaggacaatggatctgcctccacuitcacttcaagattt 



catagaaoQCtttggctocttgtctttgaacatgacananntntctggaa 



ataaggccgacatggtctgtagtatocaaaagccatcaaggacatcacca 



a ctgca 1 1 cactgaagaaaat gact aca t ca t got aaa t gcgt catt t tc 



oacaaatcttgtgtgcagtgtagattataatcacatccagccagtgtggc 



aaottctggctttatacagtgactctcctctgatactagaaaggaageec 



=agcttaccgagactocttcactgtcttctagatataaaoaggtggotot 



taggcctgaagacatttttaccagcatagaggctgatgtcagagcagacc 



ctttttggttccaacaagaaaaaattgtcttgcagctgaacagaactgcc 



accacacttagoacattacagatooagttttcoactgatgctcaaatcgc 



tttaccaagggcggagatgagagcggagagactcaaatggacoatgatoo 







2001 
1 


tgatgatgaacaatcccaaactggaacgcactgtcctggttggcggcaet 






2051 
1 


attgccotgagctgtooaggoaaaggogacccttcacctcacttggaatg 




gcnomic_hu 


2101 
1 


gottctagotgatgggagtaaagttyagagccccttacgtt&gogaggatg 




oD*Bt_rafe 
g6&e&l&_lrc 


2151 
1 


ggcgaatcctaatagacaaaaatgggaagttggaactgcagatggctgac 




eDNA^rat 
gouoffiic llTl 


2201 
i 


agctttgatgc&ggtctttaccactgcataa$caccaatgatgcagatgc 






2251 

x 


ggatgttctcacatacaggataactgtggtagagccctatggagaaagca 






2301 
1 


cacatgacagtggagtTOagcacacagtggttacgggtgagacgctcgacs 


.Wr S 


genomic Jia 


2351 
1 


cttccatgcctttccacgggtgttccagatgcttotattagctggattct 




<3DNX_ra>t 


2401 
1 


tccagggaacactgfcgttctctcagocatcaagagacaggcaaattctta 


J . ! 


rat 

genomic Im 


2451 
1 


acaatgggaecttaagaafcattacaggttacgccaaaagatoaaggtcat 


fl 
J!! 


•SfTOoalcJtiu 


2501 
1 


taccaatgtgtggctgccaacooatoaggggccgacttttccegttttaa 






2551 


agttteagfctcaaaagaaaggccaaaggatggttgagcatgacagggagg 




g^JK>s&ic_Jbu 


2601 
1 


caggtggatctggacttggagaaccoaactccagtgtttcccttaagcag 




cPN&xat 
genomiqjhu 


2651 
1 


coagcatctttgaaactctctgoatcagctttgacagggtoagaggctgg 






2701 
1 


aaaaca&gtctccggtgtacataggaagaacaaacatagagacttaatac 






2751 
1 


atcggcggcgtggggattccacgctccggcgattcagggagcataggagg 






2801 
1 


cagctccctotctctgctcggagaattgacccgc?aacgctgggc&gcact 




ganomicjhu 


2851 
I 


tctagaaaaagceaaaaagaattctgtgccaaaaaagcaagaaaatacca 




cDHA_rat . 
genoaie^btt 


2901 
1 


cagtaaagccagtgccactggctgttoccctcgtggaaotcactgaogag 



cOK&jeat 2951 
ge*ao5ic ha l 



gaaaaggatgcctctggoatgattcctccagatgaagaattcatggttct 



gCQOA&C iXXl 

gaacSioJfcw 
cOtorat 

g ori o at q^hq 

ge&osiiojta 
<JD8\_xat 

g£&OILi.O lXQ. 
g^ flOfl t i C htt 

ge&cn^cjba 
ganoaicjra 

t 

ge&oaic^bu 

ganoaicjhu 
casta*** 

cDN\_rat 



300; 
305: 
310 
315 

320; 
32$: 

330 
335 
340 

345 

350 

355: 
360 
365 
370 
375: 
380 
385 
390 
395: 
400: 
405; 



410X 
1 



gaaaactoaggcttetggtgtcccaggaaggtcaccaactgctgactctg 



gaccagt^aatcatggttttatgacgagtatagcttctggcacagaagtc 



tc^actgtgaatocacaaacactacaatctgagcaccttootgatttcaa 



attatttagtgrtaacaaacggtacsagctgtgacaaagagtatgaacccat 



ccatagca&srcaaaatagaagatacaaccaaocaaaacccaatcattatc 



tttccatcagtagctgaaattogagattctgctcaggcaggaagagoato 



ttcccaaagtgcacacoctgtaacagggggaaacatggctaoctatggce 



atacoaaoacatatagtagotttaccagcaaagccagtacagtcttgcag 



ccaataaatccaacagaaagttatggacctcagatacctattacaggagt 



cagcagacotagcagtagtgacatctcttctcacactactgoagacccta 



gcttefccoagtoacccttcaggttcacss caeca ctgcctogtctttattt 



caoattcctagaaacaacaataoaggtaacttcccottgtccaggcactt 



gggaagagagaggacaa tt tggagcaga gggaga gttaaaaacccacata 



gaaccccagttctccgacggcatagaoaoaggacrtgtgaggccagcaatc 



aagggacctgctaacaaaaatgtgagccaagttccagccacagagtacco 



tgggatgtgccacacatgtccttccgcagaggggctcaoagtggctactg 



cagcactgtcagttocaagttcatcccaaagtgcoctcctcaaaactaat 



aatgttggggtcatagcagaagagtctaccactgtggtcaagaaacoact 



gttactatttaaggaoaaaoaaaatgtagatattg^gataataacaacca 



ctacaaaatattccggaggggaaagtaaccacgtgattoctacggaagca 



agcatgacttctgctccaacatctgtatccctggggaaafcctcctgtaga 



c&atagtggtcacctgagcatgcotgggaccatccaaaetgggaaagatt 



cagtggaaacaacaccacttcccagccccctcageacaecctcaatacca 



yj 



o 



Ms;:;? 

Pi 

|1J 



cDKA_rab 
gaz*oaic_fcu 

gsxxo&lG^litt 

cDSO^rat 
go&otitlc fcu. 

cDNAjrat 
g&zxosuLc &u 

cBHXrat 
g^xYQ&rf-C Hxi 

cDNAjrat 
gonosuLc fau 

g^UO&ic^Jltt 

genomic _Ku 
cB*»_rat 

ge&oaiejbtt 

genonicjtm 

c£N2i_rat 
gea^AiqJtiu 

goxvoaxc im 

gsncj&iLc &u 

gcnoaic^hu 

C©H\rat 
goacSicJro 

goncaie^hM 

genomic^u 



4151 
1 

4201 
1 

1 

4301 
1 

4351 
1 

"4401 
1 

4451 

1 

4501 

1. 

4551 

1 

4601 
1 

4051 

• 1 

4701 
1 

4751 
1 

4$01 

1 

4851 
1 

4901 
1 

4951 
1 

5001 
1 

5051 
1 

5101 
1 

5151 
1 

5201 
1 



acaagcacaaaattctcaaagaggaaaactccottgcsacoagatctttgfc 



aaataaccagaagaaggaggggatgttaaagaatccatatcaattcggtt 



tacaaaagaacccagccgcaaagottcccaaaatagctcctcttttacco 



acaggtoagagttccccctcagattctacaactctcttgacaagtccgoc 



accagctctgtctacaacaatggotgccactcagaacaagggcactgaag 



tagtatcaggtgccagaagtctctcagcagggaagaagcagcccttcaoo 



aaotcctotcoagtgettcctagcaecata&gcaagagatctaatacatt 



aaacttcttgtcaacggaaacccccacagtgacaagtcctactgctactg 



catctgtcattatgtotgaaacccaacgaacaagatccaaagaagoaaaa 



gaccaaataaaggggcctcggaagaacagaaacaacgcaaacaccacccc 



caggoaggtttctggctatagtgcatactcagctctaacaacagotgata 
coccottggctttcagtcattocccacgacaagatgatggtggaaatgta 

agtgoagttgcttatcactcaacaacctctcttctggccataactgaact 
gtttgagaagtacacccagactttgggaaatacaaoagctttggaaacaa 
cgttgttgagcaaatcacaggagagtaooaoagtgaaaagagcetcagao 



ac 



acoaQceccactcetcagcagtggggogcccccagtgocsoactccttc 
cccacctccttttactaagggtgtggttacagacagcaaagtcaoatcag 
ctttccagatgaogtcaaatagagtggtcaccatatatgaatcttcaagg 
cacaatacagatctgcagoaaocctcagcagaggctagccccaatootga 
gatcataactggaaccactgactctooctctaatctgtttccatccactt 
ctgtgccagcactaagggtagataaaccacagaattctaaatggaagcco 
tctccctggccagaacacaaatatcagctoaagtcatactccgaaaccat 



eDH\*at 
genoSde^hu 

eDHA_rat 

a&$t&_rat 
geaomic^Hu 

g<mcSic_h« 

genomie_hu 

cDHA_rat 
g^nosuLe^fctt 

genovil c^lxu 

gam>alo_htt 

oDK&_rat 

cBEl&jeat 
g<^oSiioJhu 

cDHft^rat 
genomic to 



5251 fcgagaagggc&aaaggccagcagtaagcatgtccccccacctcagccttc 

i 

5301 cagaggccagcactcatgcctcacactggaataeacagaagcatgcagaa 

i 

5351 aagagtgtttttgataagaaacctggtcaaaacccaacttecaaacatct 

i _ ^ — . 

5401 gccttacgtctctctacctaagactotattgaaaaagccaagaataattg 

1 

5451 gaggaaaggatgcaagctttacagttccagctaattcagacgtttttctt 

1 — 

5501 ccttgtgaggctgttggagacceactgcccatcatccactggaccagagfc ' 

1 

5551 ttcatcagganttgaaatatcccaagggacacagaaaagccggttccacg 

1 — * — 

5601 tgcttcocaatggcaccttgtccatccagagggtcagtattcaggaccgt 

! r „ 

5651 ggacagtacctgtgctctgcatttaatccactgggcgtagaccattttca 

x 

5701 tgtctctttgtctgtggttttttacccggcaaggattttggacagacatg 

1 

57ai rcaagga^au^aeagttoactttggaagtaotgtggaactftaagtgcaga 

1 

5801 gtggagggtatgccgaggcctacggtttcctggatacttgcaaaccaaac 

1 

5*51 ggtggtctcagaaacggccaagggaagcagaaaggtctgggtaacacctg 

1 ~ 

r 

5501 atgge&cattgatcatctataatctgagtctttatgatcgtggtttttac 

1 

5951 aagtgtgtggccagcaacccatctggccaggattcactgttggttaagat 

1 — 

s 

€001 acaagtcatcacagctccccctgtcattatagagcaaaagaggcaagcca 

1 

0051 tcgttggggttttaggtggaagtttga&aotgoGOtgcactcfcaaaagga 

I 

6101 actccccagcctagtgttoactgggtcctttatgatgggactgaactaaa 

X 

$151 accattgcagttgactcattocagatttttottgtatccaaatggaactc 

1 

6201 tgtatataagaagcatcgctecttcagtgaggggcacttafcg&gtgcatt 

1 

6251 gccaccagctcctcaggctcag&gagaagggtagtgattcttactgtgga 

I _ 



CttTGGAAGCAGAGTCACX^CC&TCCAAATGGAACC^ 
CtaTGGAAGCAGAATt^CAGTCCATAAAAATGGAAC^^ 

aeatccGGCTTTCT<^CTCTGCGCa<^TCaCCTGTCTGGttcggagcgag 
atgtgaGGCTTTCAGMTCAGCCi^CTOTATCTGTOTGGcccgaaatgaa 

ggaGGAGAGAGTGTGTTGGTAGTGC^TTAGAAGT^ 

g gtG<^GAGAGCGTGTTGGTAG!PACAGTTAGAAGTACT(^AAATGCTGAG 

AAGACCAACATTC&GAAACCCATT^ 
AA<^{XGA<^MAGAAATCCAmAATGAA^ 

gcaagcccgtAGCACTGAACTGCTCTGTG<^^ 

gaaagtccacAGCATTG^TTGCTCTGCTGATGGTAACCC^CX^COTC^ 

a 1 1 a cCTGGATCTTACCT GACGGCAC&CAGTTTGCTAACAGACCAca caa 
ataatCTGGATTTTACCAJ^TGGCAC^^ 

tt c cccGTAT CTGATGGCAGGCAATGGCTCTCTCAT CCTTTACAAAg caa 
ttatcaG^ATCTG^TAGCAAGCAATGGTTCTTTTATCATTTCTAAAacaa 

ct cggaaca agtCAGGGAAOTATCGCTCTGCAGCCAGGAAllAAGGTTGGC 
ct cgggaggatgCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGC 

£Aeat:cgagaaactcatc<^gttagagATTGGGGAGAA(^ 
TAtattgagaaattagtoatattagaaATTGGCCAGAAOCOW?PTATTCT 

GACATACGAACCAGGGAtggtgaagagcgTCAGTGGGGAACCGTTATCAC 

TACeTMGC&OGRSGGJiw^ 

TGCATTGTGTGTCTGATGGgATCCCCAAGCC^^ 
TGCATTGTGT(?TCTGATGGaATCCCTAAGCCAAATATCAAArGGACTA^ 

CCGGGTGGccATGTAATCGAC&GGCCTCAAGTGGA^ 
CCAAGTGGttATGTAGTAGACAGGCCTCAAAT^AATGGGA 

GOAT GAAAAT GGCACGCT GGTCATCAAAGCAACAACAGC t ca cga ccaAG 
GCATGACMTGGCACC^TAGTCATTAAAG^GCAAC^ 

GAAATTATATCTGTAGGGCTCAAAAGAGTGTTGGCCAggcagttattago 
GAAACTATATCTGTMGGCTCAAAATAGTGTTGGTCAtacactgattact 

gt gt CAGTGATGGTTGTGGCC^ ACCCTTCCCCGAAT CATAAAc t a C CtACC 
gt:tcCAGTAATGATTGTAG<XTA(^C^CCCCGAATTACAAAtogt.ccACC 

CAGGAACATGCTCAGGAGGACAGGGGAAGCCatgCAGCTC^CTGTGTGG 
CAGGAGT AT T GTCACCAGCACAGGGGCAGCCt 1 1 CAGCT CCACTGTGTGG 

CCTTGGGAATCCCCAAGCOAaAAGTCACCTGGGAGACGCC^^GACTCC 
CCTTGGGAGTTCCC^GCCAgAAATC&CATG<^^ 

CXGCTCTCAAaagcaacagcaagaaaacccCATAGAAGTGAGAIGCTTCA 
CTTCTCTC»vAcggc5aagtaaagagaggac^CaTG(^GTGAGCAGCT!rCA 

CC£ACMGGTACGCTgGTCAT£CAGAA 

CTTACAAGG^ACCCTaGTCAT^CAGAATCCOCAAACCTCCGATTCTGGga 

tcTATAAGTGCAGAGCTCAGAACC^ACTTGGgacTGATTACGCAACAACT 
taTACAAATGCACAGCAAAGAACCCACTTGGtagTGATTATGCAGCAACG 

TAC&TCC&GGTACTCTG&C^^ 

mTATTC3AAGTAATCTGACA5GAAataataaagtcaacaacatctgggca 
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Exon/Intron 
No. 


Exon 
start 


Exon 
end 


Exon 
length 


Intron 
length 


Remarks 


1 


1 


208 


208 


69 


No valid splice site found upstream 
this exon 


2 


277 


429 


153 


18 




3 


447 


485 


39 


1561 




4 • 


. 2046 


2244 


199 


1351 




5 


3595 


3724 


130 


3254 




6 


6978 


7359 


382 


4123 




7 


11482 


14903 


3422 


38 




8 


14941 


15307 


367 


51 




9 


15358 


15825 


468 


1039 




10 


16864 


-17760 


-897 


? 


Last exon might be complete 


11 


?(D 


2317 


-231 


25 


Exon is not complete and start site is 






7 




not known 


12 


2342 


2397 


56 







Figure 12 

ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGCCTGGTCGCCACC 
CCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCTACGGAGGTACACTGCACATTTCGG 
TACCTGACTTCCATCCCAGACAGCATCCCGCCCAATGTGGAACGCATO^TTTAGGATACAACAGCTTG 
GTTAGATTGATGGAAACAGATTTTTCTGGCCTGACCAAACTGGAGTTACTC^TGCTTCACTVGCAATGGC 
ATTCACACAATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTATAAT 
AAAGTCCGAAAACTTCAGAAAGATACTTTTTA^ 

AACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGGGCTCAACTTTCTCCGCCTGGTGCACTTGGAA 
GGAAATCAGCTCACTAAGCTCCACCCAGATACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATC 
TCTTTCATTAAGTTCCTATACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTAT 
ATGCCTGACCTAGACAGCCTTTACCTGCATGGAAACCCATGGACCTGTGATTGCCATTTAAAGTGGTTG 
TCTGACTGGATACAGCCAGATGTAATAAAATGCAAAAAAGATAGAAGTCCCTCTAGTGCTCAGCAGTGT 
CCACTTTGCATGAACCCTAGGACTTCTAAAGGCAAGCCGTTAGCTATGGTCTCAGCTGCAGCTTTCCAG 
TGTGCCAAGCCAACCATTGACTCATCCCTGAAAT 

GCTTTCATCTCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGATCAGTCT 
GGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCCATTGCATTCACTGAA 
GAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTTTTGGTGTGCAACATAGATTACGGTCAC 
ATTCAGCCAGTGTGGCAAATTTTGGCTTTGTACAGTGATTCTCCTCTGATACTAGAAAGGAGCCACTTG 
CTTAGTGAAACACCGCAGCTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATTTTTACC 
AACATAGAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAGCTGAAC 
AGAACTGCCACCACATTCAGTACATTACAGATCCAGT^ 

GCAGAGATGAGGCCAGTGAAACACAAATGGACTATGATTTCAAGGGATAACAATACTAAGCTGGAACAT 
ACTGTCTTGGTAGGTGGAACCGTTGGCCTGAACTGCCCAGGCCAAGGAGACCCCACCCCACACGTGGAT 
TGGCTTCTAGCTGATGGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTAATAGAC 
AAAAGTGGAAAATTGGAACTCCAGATGGCTGATAGTTTTGACACAGGCGTATATCACTGTATAAGCAGC 
AATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAACCTTTGGTCGAAGCCTATCAG 
GAAAATGGGATTCATC^C^CAGTTTTCATTGGTGAAACACTTGATCTTCCATGCCATTCTACTGGTATC 
CCAGATGCCTCTATTAGCTGGGTTATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAAA 
GTTCTAAACAATGGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGCTGTGTG 
GCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAAGGACAAAGGCCC 
TTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCCAATCCTATTGCTCATCTTAAGGAG 
CCACCAGGTGC&CAACTCCGTACATCTGCTCTGAT 

AGTAAGAGGCACAACTATCGGGAATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGTTTTAGG 

GAGAATAGGAGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTGTTGGAG 

AAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGAAAATACCACAGTGAGCCCACCCCCAGTGGTCACC- 

CAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTC^GGCATGCTCGCTCTACATGAGGAATTTATG 

GTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGGACAGTGACTGCTGACTCCAGAACAATATCTGAT 

AGTCCTATGACAAACATAAATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCT 

GAAGAACCCACAGATTTC^^CTGTCTACTGCTATTAAAACTACAGCC^TGTCAAAGAATATAAACC^ 

ACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATTCATCCACTGTCTTTCCACTGCTACTTGGA 

GCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCCCCAATA 

ACAGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCACCAACAAACTATTATTA 

GAGTCAGTAAATACCACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCAC 

TTCTATTCTCACACTACTCAAATACTTAGCACCTCC^^ 

TCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTTCAGACGCTTTGGGAGGCAGAGG 
AAAATTGGCGGAAGGGGGCGGATTATCAGCCCATATAGAACTCCAGTTCTGCGACGGCATAGATACAGC 
ATTTTCAGGTC^VACAACCAGAGGTTCTTCTGAAAAAAGCACTACTGCATTCTC^ 

GTGACATGTCTGTCCTGTCTTCCCAGGGAGAGGCTCACCACTGCCACAGC^GCATTGTCTTTTCCAAGT 
GCTGCTCCCATCACCTTCCCCAAAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCTACAACTCTAGTC 
CAGAATCC^CTATTACTACTTGAGAACAAACCC^ 

AGGACTGAAATTTCCCAAGTGACTCCAACTGGTGCAGTCATGACATATGCTCCAACATCCATACCCATG 
GAAAAAACTCACAAAGTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGATTCA 
GTGATTACATCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGACTATTATAGCCATTACAAGGTTT 
TCAAGAAGGAAAATTCCCTGGCAACAGAACTTTGTAAATAACCATAACCCAAAAGGCAGATTAAGGAAT 
CAACATAAAGTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCTTTACCACAG 
AGACAAAGTTCCCCTTTCCATTTCACCACACTTTCAACAAGTGTGATGCAAATTCCATCTAATACCTTG 
ACTACCGCTCACCACACTACGACCAAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCC 
TTCCCACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAAAAAGCATCATATCA 
ACGCAAACAGCAATACCAGCAACAACTCCTACCT 

GAGAGATCTAGAGCACAAACAATACAAAGAGAACAGGAGCCTCAAAAGAAGAACAGGACTGACCCAAAC 
ATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACACCTCCTGCTCTGGCATTCACT 



CATTCCCCACCAGAAAACACAACTGGGATTTCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTG 
ACAGATGTGATTGAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCTTCTGAAACA 
ACTTTGTCCAGCAAATCACACCAGAGTACCAC^ 

TTCTTGAGCAG(^GTGCTACTCTAATGCCAGTTCCC^TCTCCCCTCCCTTTACTCAGAGAGCAGTTACT 
GACACACGTGGCGACTCCCATTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCCTCAAGG 
CACAATCTCCAAATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTACATTCT 
ACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGGGCAG^ 
TACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAGCCAGAAGTAAGCATGTTG 
GCTACTACAGGCCTGTCCGAGGCCACCACTCTTGTTTCAGATTGGGATGGACAGAAGAACACAAAGAAG 
AGTGACTTTGATAAGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTTTGTCT 
AGGTATATATTTGAAAAGCCC^GGATAGTTGGAGGAAAAGCTGCAAGTTTTACTATTCCAGCTAACTCA 
GATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCCCACCATTCATTGGACCAGAGTTTCAGGA 
CTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCTGTCCATCCAG 
AGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCGCATCCAATCTGTTTGGCACAGACCACCTT 
CATGTCACCTTGTCTGTGGTTTCCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACAGTT 
CATTCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACCTGGATT 
CTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCTGTGGTGACGGTTGACGGA 
ACATTGGTCCTCCACAATCTCAGTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCAACCCAGGT 
GGCCAGGATTCACTGCTGGTTAAAATACAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAAAGGAGG 
CAAGTCATTGTAGGCACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCC 
AGCGTTTACTGGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCCAAGTTGTTC 
TTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTTCAGACAGGGGCACTTATGAATGCATT 
GCTACCAGTTCCACTGGTTCGGAGCGAAGAGTAGTAATGCTTACAATGGAAGAGCGAGTGACCAGCCCC 
AGGATAGAAGCTGCATCCCAGAAAAGGACTGAAGTGAATTTTGGGGACAAATTACTACTGAACTGCTCA 
GCCACTGGGGAGCCCAAACCCCAAATAATGTGGAGGTTACCATCCAAGGCTGTGGTCGACCAGTGGAGC 

TGGATCCACGTCTACCCTMTGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGACAGTGGTGTCTAC 
TTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATGCATGTTAGCCTAAGACTGAAACCT 
■GCCAAAATTGACCACAAGCAGTATTTTAGAAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGC 
AAAGCTTCCGGCTCCCCAGTGCCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAACAATGCA 
ATGCAAGCCGATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAACAATGGAACTTTATACTTC 
. AACAAAGTTGGGGTAGCGGAGGAAGGAGAXTATACTTGCTATGCCCAGAACACCCTAGGGAAAGATGAA 
ATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATAAGGCAGAGTAACAAAACCAACAAGAGA 
ATCAAAGCTGGAGACACAGCTGTCCTTGACTGTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGG 
TTGCTGCCTTCCAATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGGTCTTTG 
ACCATCAACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGTGGGGAT 
GACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAATGGTCTGTATACAAACAGA 
ACTGTTATTAAAGCC^CAGCTGTGAGACATTCCAAAAAACACTTTGACTGCAGAGCTGAAGGGACACCA 
TCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTTCCTCACAGCCCCATACTATGGAAGCAGAATC 
ACAGTCCATAAAAATGGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGT 
GTGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATGCTGAGAAGA 
CCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGAAAGTCCACAGCATTGAATTGC 
TCTGTTGATGGTAACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCCAATGGA 
CCACAAAGTTATCAGTATCTGATAGCAAGCAATGGTTCT^ 

GCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATT 
GGCCAGAAGCCAGTTATXCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAGAATCTCTATCA 
CTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGGACTATGCCAAGTGGTTATGTAGTA 
GACAGGCCTCAAATTAATGGGAAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAAGCAACA 
GCTTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACTGTTCCA 
GTAATGATTGTAGCCTACCCTCCCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGGACAGGG 
GCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATGCCTGAC 
CACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGAAGTGAGC^GCTTCACTTACAAGGTACC 
CTAGTCATTCAGAATCCCCAAACCTCCGATTCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGT 
AGTGATTATGCAGCAACGTATATTCAAGTAATCTGA 
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Figure 14 



VMS cDNA 5 



CAGftJU^GTCCAGGMA-GGfiftA 



r&t_cD8flA. 

huni£a_5+3_corrected 
atus cDtSA 5 



husnan_5+3_correcte$ 
mas cDNA !5 



rat_cD&A 

humanJ5+3_corrected 



GACacanOT^GCAAa^ GA 



mis cDfc& 5 



rat_eD8A 
tau* ct)NA 5 



hux&anj5+3_correcte<5 
nus cENA 5 



CCCCCAGCT 



--AH^CKEAAGCCTaCK^^ 



amsjcSilA? 



AAG^CTmETTCCCT^^ 

iASAACGTTCCTTCAATCAGGTGUVJ^ 



rat_cDNA 
muB cP8Aj5 



3m3W^<5<3AAK»aM^^ 

ftT(5AAfiG3!AAM)S(^AC5AGGAA. 

TTA£AftG?*mmaAAA^^ 

*±* ** * ****** * * 



atatjcsBHA 

ht»a&_543_corrected 



^CAGCTGCTT<K^<3ArCTCG^ 

*** * ***** ** ***** * *** ********** * ********** *** ** 



ratjcDNA 

2iu»aiiJ5+3_corrftct«l 
urns cGKA 5 



************ ******** *** **** ** ***** ***** ********* *** 



rat_cDNA 

hu£&an_S+3_corrected 



rGACTTCCXTCCC?^CTGK^ 

**** <•*+*******♦ * ****** ************* * ******** ** *** 



humSq_5 +3_co rrect e<i 

sols dm 5 



*** * ******** **** ** *** ******** ** **************** 



rat_c0NA 



** ***** ************* * ** *************** * ****** ***** 



ratjCDSfc 
mus &K& 5 



rat_ei*& 

humn^5+3_correcte<i 



fcuaaan_5+ 3_co reacted 
was c5&h 5 



ratjeDKA 

hum&n_5+3_correote<3 

mis cDNft. 5 



ratjcDUA 

hcB»anJ5+3__correoted 
cDNA 5 



ratjoDta 

huraan_5+3_corrected 



rat^cOHA 

hns5n_5+3_oorreoted 
mud cEW. 5 



hufi5n_5+3_ Corrected 
mis cDNa 5 



rat_cDNA 
mas cDNA 5 



*********************** ******* ** ** *** ******** ** ** 
CTCGG<3AGCTTGGTCCGGTTO^a^G^ 

*** ******** ** ****** **** ******** ***** ** ** ***** *** 

* ***** ** ** * ** ****** **** ** * ******** * *** * # 
CTCC&CCCa«GA!FAC^^ 

***** ***** *********** ** ***** s*************^ ****„ 
*** *** * ***** ********** * ************ **** ********** 

***** ******* ***** ** ************************* ******** 

AMCTGOTCTCCraCTG^^ TTGTTTG 

************ ** ***** **** *** t ****.** * * 

* * * * * *** * ** *** * * * **** * 

-OCCC^AACCTCCCOCXXA CXTCCCT&OCTACCCATTC — CCA3TTTTTGGC 

** * * ** ***** * * * * ***** 

CCTOGCATTCCCC 

* * * ** 



rat_CDKA 
mus cSha 5 



httsan_5+3_TOrrected 
mis cM 5 



AAtaaAAATGACTAOtfO^^ 



rat_cW3A 

bman_5+3_corrfccted 
we cDNA 5 



ATXATAATCACM , CCAGCC&GTST^^ 
ATTACGGTOVCAITTCZySC^ 



hu&an_5+3_c»rrected 



human_5 +3_corrected 
IQUB CPNJV 5 



himan_5+ 3_corxecte<5 
mus CDNA 5 



rat^cDNA 



CAMACAGAKXSA^^ 



htmaa_5+3_corr«cted 



CAGTGMACACMATGGtfCn^^ 



ratjCONA 

huit£n_s+3_corrected 

mils cDNA 5 



rat_cDNA 

human_5+3 — corrected 



TGG&TTGGCTTCXASCT^^ 



retjcDNA 

huaiin_5-f3_corrected 
mas cDKA 5 



ra.t^joDNA 

humai^5+3_corrected 



G<^A^CAC^Graa3V«5CAGCAAT^ 



rat_Cl3WA 
«us cDHA 5 



CTGTGOTAGAGCCCtfATeGAGftAAGC^^ 
(^TGCTA6A&CC5rTTGGTC^^ 



sous cDNA 5 



TT(^C5AAAGACTTGA^rc^ 



rat_cDNA 

human_5+3jCOrrected 
mas ct&IA 5 



GGATTCTTCCW3G<^CACTG^ 
G<3GTTA!TTCCRGGAAAC&AT^^ 



fatwtan_5+3_corrected 
tm3 C^BV 5 



huaSn_5+3_correcte<J 



mas cBKA 5 



rat_CDfc& 

hiamSa_5+3 corrected 



Tr^CaTCTTMSGMCC^CXaGOTGC^ 



ar&tjcDNA 

hunan^5+ 3_co rr »c ted 
3aus cBN& 5 



A<»TTG<SA7^CAC&CCffC^ 



ratjeONA 

hmanj5+3_corrected 
was ct>N& 5 



XAtjetJKA 
sous cDNA 5 



huB^n^5+ ^corrected 

JCEUS 5 



a:at — ct&lA 
mis c5fc& 3 



CMACATACCTSGTGft?^^ 



r&tjCDNA 

husan_5+3_co rracted 
TOJS cDNA 5 



mis cStfA 5 



rat^cWaA 
JttUS ct>NA 5 



CACAAATACTACCACCTGaAG?^^ 



ratjcCSCA 
suae <£DKA 5 



CAG<XJA!r<3TC&AAGMTATAAACCCM 



mas cDHA 5 



AftAACCCSMCATTAIXJrTTCCSkTC AGI^«OT5?^TTCGaGATTC!W3-^ 



htata2!_5+3_oorrecte4 



.CftQGCA GGZAG&SCMCn^^ 



ra.t_cWh 



WAGR!TGHTCA&TGTCZU^1?M?GCX1A 



rat_cD*3& 

hunSaJS -t-3jcorrected 
xonB cDKA 5 



huc^ — 5+3jcorr acted 
WIS C&JA 5 



rat^cDKA 

huaSt^S+Sjcjorractad 



rat_cDHA 

ht^n_5+3_corrected 



»tjcDH& 

human_5+3__correet£d 



cms cDNA 5 



rat_cBN& 

fau»an_5+3joorr*ctfcd 

JKUS cDNA 5 



OTCTGTGCTGtCTTCCC?^ 



huroan_5+3_cozrected 
sua oDNA 5 



human_5+3jCOrrected 



h«wau_S+3_correctfed 
nojs cDN& 5 



rat_c&KA 

kuroaa_5+3_correeted 
wjs cDNA 5 



rat_cDHA 

human_5+3_corrected 
mas cDNA 5 



sois cDNft 5 



rat <?DNA GGa&fc&SrCTCT^^ 
human_5+3_corrected 0<52^^tCCXrrGGC^ 

S!U5jCatCl_5 — — — — <• 



rat^cDNA 

htnaan_5 f 3_correctfcd 
mis cONA 5 



hunSn_5 +3_corr ected 



JRJ5 cDNA 5 — - — — ™- *— — — — — — ^ — 



twsj&t>th]> ~ — — *■ — ™ — 



XluSjCSDtSAJS ' 



ams cDua 5 — ~~ — — . . ™™. 



rat_cDH& 

humaji_54-3_corxec ted. 
JQU* CDHA 5 



ratjoDNA 

human__5+3_co r r ected 



huaan_5+3_csorreot8d WPG^GAGCACft*OT<K^ 

mis CDNA 5 — 



ftuman^5+3_corrected 
mas cONA 5 



mU3 GtASh S 



fcuman^5+3_correcte<i 
sms cDiOV 5 



rat_cDNA 
reus cDNA 5 



ratjcDNA 



mis cDMA 5 



2ftu2 CDNA 5 



ratjcDNfc 
mas cDNA 5 



ratjSDNA 

huie5n_5+3_correct^d 
wets _cDtt& 5 



mas cSsft. 5 



rat^cDNA 



ratjcDNZV 

human_5+3_po treated 



ratjcBNft 



CTGTGGTTTGC13V!rCCTCCC2U302V!fCCTGG^ 



hm£Q_5 +3_corrected 



hu20ap._5^-3_corrected 
aus c5n& 5 



hurran_5i-3_correotefl 
hub cDKA 5. 



ratjorai 



GTTTAAAACTGCCCtGTACTGC^^ 



hmnin_5+3_corrected 

mas cDUA. 5 



rat_cDNA 

human_5+3_corr<3Cted 
mxs cE*JA 5 



CC&#3^CC3XiaG^ 



human_5+3_eor4rected 
xm$ cDNA 5 



3aoaan_S f 3 ^corrected 



TAczmamACTGcro^ 



ratjODNk 

htf&axi_5+3_cc> greeted 



Wtt0 CDN& 5 



GAT(XCTGTTTArr<SGA!rCAC^ 



rat_cD£S\ 



GWC2AAATGG<3<3^^ 



ratjcDHA 

human_5 f 3_co rrea ted 
©us cdn& 5 



mus CDNA 5 



fcuaan J 5+3_ corrects 
mus cDNA 5 



rat_jDtft3A 

humanj? +3_co r r ect 



fcumanjB+S^ corrected . 



rat_CtSNA 



anas CDttA 5 



CTGTCCraS&CTGTG&SOTC&CTG^ 



rat_CDN& 

toaan_5i-3j2orrecte<i 
mis cBfcJk li 



iivroan_5+3_corrected 
mas cDNft 5 



ratjODNk 

huaanj5+3 corrected 



hvon&o_5+3_ccrr«cted 



ratjCDNA 

hta^_5+3_corrected 
taus ctftGV 5 



huraan_5+3^corrected 
mas eDfcCV 5 



cms com 5 



ratjODNJl 

huwan_5+S_corrected 
JOU3 cDtd 5 



hus<Sn__5+3_correct«d 
mas cOfCV 5 



ratj3DNA 

hunSji_5+3_correated 



s^rrA^GRcc^CMrrccccCT 



hu&sm_5 *3_cor rected 



rat_c$N£ 

touaSa_5+3 — corrected 
mu5 ct)NA 5 



xat_cDK& 

huzftgtD_5+3_corroct^d 



taus o5sa s 



rat_cDNft 

human_5+3_corrected 
ttUS cDNA 5 



rat_cDNA 

htn&an_5+3_jDorrected 
mis cDi©. If 



humaajH3 w corxect€d 



himian_5+3^corrected 
ions cWh 5 



human_5+ 3_corrected 



rat_cBRft 
mas cSfcCV 5 



rat_cDNA 

hxaaan_5+3_cQxrected 
muts cDHA 5 



ATlCTGGGAtM&C&AATGC^^ 



mas cD£A 5 



AamX&GGTACTCT^^ 

AmrXa^SIWCTCTGAC^GAX A2AAIARAGT--CAACAA CATCXGGGCA 



rat_cPNA 

hui»n_5+3_corrected 



ratjcBNA 

human 54-3 corrected 
mas cDNA 5 



ATAC&TTTACACTAS'T^^ 
AIACATTTACA0JATTA2^T1ACA^ 



GCATTGTGA&CTGA& ACCGAGTCTdG — TGGftJCTCAAACCR^CTC^MCnM 

GCftTTAT©tfCTGftX<^^ 



kimaxi_5+3_corrected 



GGCfcCOTTIATTTTGOC^ AC&TTGAAACGGTXCUiCTAT 



iOT»an_5+3jcorrect©d 
mus c5na 5 



GAAAT?&C^CGGCTi^ 

AAAftTMC&AA5K3GCI?^^ TGAACTT-CTAAT 



huraaa^^eorrected 



rACd&ATGTCACaAAGATC^ 

-<xiagttgcctagtgiccmc^^ — 1 — A 



rat_ctm 

iman_5+3_corrfccted 

mis cBNA 5 



ATG<2*En<$AXKG<W^ 

AffiGAGAQJilGGMftAI^raAiiATCrGC — — AATCT — ATGTAIAftAIATTT 



ratjoDUA 

fcura>n_5+3_corrected 
IMA? ciDNA 5 



rat^cDNA 

hun5nJ5+3^corr«cted 

£813 CDBA 5 



A&AA 



(rat_cDNA: SEQ ID NO:7) 
(human_5+3 corrected: SEQ ID NO;8) 
(mus_cDNA_5: SEQ ID NO:9) 



Figure 15 



huaan_5+3_corrected KKTO^GITCIiVSE&VIC&VAS 
aouss 5 corrected KQKB3»mcm$t^<^V^ 



rat IU3ttjGYNSLmT^Dn*SI£K^^ 
hu»a©_5+3_c©rrect*d RINLGYN^VRI2iCTDF$GL^^ 
»ouse $ corrected RYNW^$imT^FSGI£^S3^^ 



rat BKDI*t<^SLV&IffiDH^ 
mou$6_$ corrected £KDT3itGIiRSX/rai£LDBmtf^ 



rat IEKTSFira£XSWaFWSI^^^ 

ttOuse"5 corrected IF3y?$FIKXLYLYT3NF-T5iPKE^ — 



ktfl^_$+3j5orifccted IKOOKDRSPSSftOQC^ 

mouse 5 corrected -™ — — ™ 



rat S^SPQDFZBITC&I^iaMX^GNX^ 

tama_6+3_co«eeted $afis*qgsh&pfg$lti^ 

2aouse_5_corrected — ~ — — — 

rat NLVCSVDYNHIQPWQLlkLYSD 

mouse 5 corrected — — 



rat DVFADFFtfFOa^Km^ 

«0\l9^J5jcorrftcted — — -« — ~ — — 

rat PK£ERTVI*VWTI*a^ 

fflouseJBjcorracted • — — — — — 

rat XDSFDS!G&m3I$mtfVD^ 

Ittoaaa^^cprrescted adsfdtgvtb^issnydd^iltyb^vvs^ 

nousft 5 corrected —.<——-, — - — i — — — 



rat FI^IWII«8!r?rSQmi^niK 

ftous« 5 corrected — ^-1—. 



r^t SVQKKGQ8MVEHDREAG<^ 
fctaaan_S+3jcsorrect€d SVKMKGQRPLEHIXSTEGSGIDES^ 

mouse 5 corrected — « — — r~ — ~ 



rat 

bucan_5+3_corxected 
jooiise 5 corrected 



rat * PIAVPLVELTDEEKD^GMIPPDS}^^ 

hroan_S+3_coi:rected PWTQLPNIPGEEDDSSQII^^ 

moused corrected — 



rat EVS-TVN?2TIiOSE3I^^ S7 

hum5ox_5+3_corrected £FSPVVNSQI£m£m>FKLSt?aiOT^^ 

sious e_5_co rrec te d — — — „. 

ret m*U^QAGR2^--S0$^VTG<3^ 

h m an ^ 5f3_correOted TEFQDSD£M<^GEi&F£SRPPITVKJM^ — LLESVMTTNSfi — Qt 

mouse~5 corrected - — — — 



ret 

human_,5+3_Gorrecte d 
Rou$e~5 corrected 



SV^SEPPHNHFYSBTTQn>CTSTreSt>PH^^ 



rat 

&uai^_5+3 w corrected 
lWttee_5jCorrected 



rat 

ht3man_5+3_correeted 
mouse 5 corrected 



TESRGRVl^PHRTPVLE&HBHB^^ 
IGGRC^ISPYRTBVI&KSRYSIFRS^ 



«AAI£ FPSAAP IT FFK^DI&RVPSEESTTL^^ — SVEKTTPTIKTFRTSI 



rat 

human^S+ajeorrected 
i&ouee 5 corrected 



NHVIPTE^SK¥SAPTSVS2<aK$PVD^^ 



«3« 
n 



btBt^_5+3_corrected 
mouse"~5 corrected 



h\aaan_5+3_co rrected TLSTSVMQIPSNTI<ttahhttt^ 

toouse 5 corrected — — - — — — — . 



aouse 5 corrected -— — — 



ret ' S^TTMmAESHSFRQDDG^ 

huzuan_5+3__corrected T*MTP— PAIAFTHSPFEffrr$I$STlSF^ 

ao use_5_co rr ected — ™- — — „ — — „. — « 

rat X^ySQ&STTVKRAS-DTP-Pm^S&^FV^ 

hwnaa_$+3_corrected ISS^HQSTTSEI^lJttPIPPn^SSSATIi^^ 

leous^S^corrected ~ — — : . , , _ 

*at VTIYESSRHNTDiQQP3AEA^ PN PEIITGTTDS PSt^P^S^SV^AI^VDKPQNSKRKPSF 

ilUXcan_5-f3_corrected VXL5SS$RBtf— I^KPSSQIEP I*TSSISOT.IJ5STPMmTTVKS®T$mPSP 

aouse^S^corrected — 



mous e_5_corr ected — ~- « 



Figure 16 



rat 

hufflan 5+3 corrected 



rat 

braan_5+3_corrected 



*********,** ^fc^**************** • # -*****^ *************** . 



rat 

auman 5+3 corrected 



EKDTFYGI£SLViUtfLDBN^ 
QKDl^GI£SLT i RLHMDHNNIEFIN^ 

******** ** a *** ************ t **** t ********* t **************** 



Si 

w 



rat 

human 5+3 corr^ct^d 



rat 

human 5+3 corrected 



rat 

htaman^5+3^correcte<i 



rat 

human 5+3 corrected 



+** **** t *•**********♦*******♦*»*** **** *******»**«*;*; X **; 

******** ** ^********** ^ : ^ic* *»*****„******* t 

** ***.*; ********** A**.*.************* 

*** : ** *******- ***********x* ****** ****** ********** 



rat 

human 5+3 corrected 



rat 

huzcan 5+3 corrected 



*;**** *- *;;* ********* { ****** 5 ************** % « ***** ** 



rat 

human 5+3 corrected 



***** .*.*****,* ************** *<■ **—*»•*** ******** ***• 



rat 

human 5+3 corrected 



PD^ISWVIPG*mVL¥GSSSU2C^ 

******** .*** # *. *.***...*******************;********,** *j* 



rat 

hwsu* 5+3 corrected 



** ; **** : *** *• ****^* < + - a :* s*** ::s: 



rat 

human 5+3 corrected 



HRI>LtHIlI^GI)STL5ilFR£KJU^IiPIiSARRl DPQRRAMiLRKAKKNSVPKKQENlTirKFY 
.*.* ******* *****{**•«* ********^***********. 1 * % *.***** # * 



rat 

human 5+3 corrected 



PIAVH,VELTDEEKD?£CmPPD£En*^^ 
PVVTQLP1UPG^DI>S8Q1^^ 

*;.- * : : - . ,***** *** # ;*.*: **** **,* ** 



rat 

hwtaa_$+3 corrected 



*.* .**.* * „**** : . : **;;*.:**;::*;*:.*****. :** 



rat 

human 54-3 corrected 



.*..** * *.* # ** * : * * . s . ** * * s*s.i* *:*: * 



rat 

huaan 5+3 corrected 



^ ? **^* m w * **** * * ; ^** -a ** : * * * *«**, * *.** *•***. *.^ 



rat 

human 5+3 corrected 



rat 

hujsan 5+3 corrected 



* ,***; ,*;*********« .*.; s*# ::*.»t » •** * :* • * **„* 
TA&LSVPSSSHSM^KTHHV^^ 



rat 

hiafflan 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



,.* ** * ** *; S ; •*** *;.*.. .U.* :* 

; *;**.*+ * J* ****;« ;* * : * $J, *** :* ** **** . * 

TI^TSVMQIPSNTLTXAHHTTTK^^ 



• • » * 



HI, 



.*-*** *** * * 



rat 

human 5+3 corrected 



STl^T-VTSP^g&SVIMSl^l&RSi^^ 

STQTAIEA!TTKTFP&S V iT If tTQITERSKAQT XQMQ£PQI3Ciil8.TDPtiIISFDQSSGF3DTPT 



**** *** **« 



rat 

hu9aan_5+3_corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



.,***•*** ; ; * •**;;•** * m * ** . **** # * * ***r 



I^KSQESTTVStt^-DTP-PFtM 

SSKSEOSTTTEKASLDTPIPPFLSS SATl^PVPISPPFTQ^VTDTRGDSHBLMTNTW 

: ***,$j** *** «*^* ^****. .***;; * *n ;* ** 

TIYE$SttfMI!DLQQPS&EMPNPEIITGT^ 

KLHESSRHN— LOKPSSQIiEP LTSSTSNU^TJWALTrVKSQNSKLTPSPV 

* **+**: ****** . * **** **** 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

hitman 5+3 corrected 



rat 

human 5+3 corrected 



ret 

hujoen_5+3_corrected 



rat 

human 5+3 corrected 



rat 

huraan_5+3jcorrected 



AEYQmiKPYSDIAEBX33tfCT 

.****:**.,*•*: ***** 



*„**• ****** *** 



TSKHLPinrSIrPKmiKKmiGGK^F^ 
TSKLLPFDSLSmPEKPRlVS^^ 

*** **; :;:****j********:*****^********}*** ******* * : 

:*;*»*:**♦ ;************,*********** * -v.**;**;**** ** 4 ***.« 
..****** *********.** * ******* ******** j * j*** ; # *** *****. : *w 

* j ************* .************-*****. ** ; **^*** a * ************ 

QPSVHm mffiglJ gl^YggRm^PHGTL 
QPSVmTLSI^TE^m^FTNSKril^ 

****.*** **** ; #**± ; *;* : ;# ^ ******************* 

MLM^VTSPJUXJASOi^S^ G 

***;•* * ********* A***,*,********** j*** ; *.****4*** ; int * 



huzaan 5+3 corrected swiflTCPN^FlGSVTEKDSGV^^ 

— * — * ********* ^ J* *************:**** ***,*****•****;*** 

huaan 5+3 corrected I£i^FQVIX2<^GSFVFElSW^ 

— ~ *******************;****+***;***, ******•**;*****$*******•** 

human 5+3 corrected V^SGDYTCIS^TIKaCDeMKVHLT^ 

~ — ******* * ******************* ****** ** w **.**«**********;** 

ra t PNVF^PS&OTSFSHSi^EH^^ 

luraan 543 corrected PKimLPSKimSFSIDRCTFHMG^ 

— " * :: ******* :; **** **j***** «*.*;*** ft***.*****;******** **** : 

hvman 5+3 corrected ' VSKPPLI^YT^TVIiCATAVR^ 

*~ ~" ************** A******.*****;*****.* ****,****.***;***; 

human 5+3 corrected TVHKNGTraiJtfr/fcLSDSADFICV^^ 

~ *~ *** *****.**♦******** ** a * # *****************************; ;** 



rat 

huaan 5+3 



corrected 



r at CAftRi^<^EKDII3£IG^^ 

human 5+3 correctod c^FmOTEKLVTLElGQ^^ 

"* ~ *************.-************ ** ** # .***_**************-*** * # 

rat <3OTD£PCrra^ILHE*^^ 

human 5+3_COrrected GYVVP£PQItfGKmiHD^ 

"~ *j* 2 **** ;: ******«******* .«*•*•*****.«***** j j «*»* # ** .******* 

rat Iira^^OI^GE^OiaCVALGIPKPKVTra 

human_5+3 corrected Tt&PPRSIWRTGMTOTilOT^^ 

w * *** *♦********-***•**** * ***** 4 * } ^ :> * ** ** **** 

rat viomrrsoSGvm^imi^oxKLTiiQvt 

human 5+3 corrected VICfclPQTSDSGims^^ 

~" ™ **>* ********** ** : *** : *****j 



(rat: SEQ ID NO: 13) 
(human_5+3_corrected: SEQ ID NO: 14) 



Figure 17 



MQKRGREVSCmSLTAICLVVTPGSRVCPRRCACYVPTEVHCIFRDLTSffDGPANVER 
VNLG YNSLTRLTENDFSGI^RI^IXMLHSNGIHRVSDKTFS GLQSLQVLKMS YNKVQDE 
KDTL YGLRSLTRLHlJ)H]Nn^FINPEAFY GLTLLRLVHLEGNRLTKLHPDTFVSLS YLQIF 
KTSFKXLYLYDNITSLPKEMVSSMPNI^SLYLHGNPWTCDCHLKWI^EWMQGNP 
(SEQIDN0:15) 

Figure 18 

MKVKGRGrTCLLVSFAVICLVATPGG 

PJNLGYNSLVPJ.METDFSGLTKLELLMLHSNGIHTIPDKTFSDLQALQVLKMSYNKVRK 
LQKI)TFYG1J3.SLTRLHMDHN]^ 

QIFKISFIKFLYI^DNFLTSLPQEMSYMPDI1)SLLYLHGNPW 

KCKKI)RSPSSAQQCPLCMNPRTSKGKPIANlVSAAAFQCAKPTro 

AFISPQGFMAPFGSLTLNMTDQSGNEAN^^ 

VCNIDYGfflQPWQILALYSDSPLII^RSHLI^ETPQLYyKyKQVAPKPEDIFrNIEADLR 
ADPSWmQDQISLQLNRTATTFSTLQIQYSSDAQ 

I^HTVLVGGTVGLNCPGQGDPTPHVDWLLADGSKVRAPYVSBDGRILroKSGKLELQM 

ADSFDTGVYHCISSNYDDADILTYRITVVEPLVEAYQENGIHHTVFIGETLDLPCHSTGP 

DASISWVIPGNNVLYQSSRDKXVLNNGTmiLQVTPKDQGYYRCVAANPSGVDFLIFQV 

SVKMKGQRPIJEHDGETEGSGLDESNPIAHLKEPPGAQIJ^ 

NYRELTI^RRGDSTHRRFRFJNIRRHFPPSA 

SPPPVVTQLPNIPGEEDDSSGMLALHEEIMVPATKALNLPARTVTADSRTISDSPMTNrN 
YGTEFSP V VNS QJLPPEEFTDFKl^TAILTTAMSKNINPTMS S QIQGTTNQHS ST VFPLLLG 
ATEFQDSDQMGRGREHFQSRPPITWTMIKDVNVKML^ 
SEPRHNHFYSHTTQIIJSTSTFPSDPHTAAHSQ 
PYRTPVIJ^PvHRYSIFRSTTRGSSEKSTTAFSATVLN^ 

TFPKADIAR WSEESTTLVQNPLLLLENKPS VEKTTPTIK YFRTEIS Q VTPTG A VMT YAPT 
SIPMEKTHKVNASYPRVSSTNEAKRDSVITSSI^GAITKPPMTIlAITRFSRRKffWQQNFV 
NNH^KGRIIttlQHKVSLQKST^^ 

AHHTTTKTHNPGSLPTKKELPFPPLNPMLPSnSKDSSTKSnSTQTAIPAT^ 
TQTERSRAQTIQREQEPQKKNRTDPMSPDQSSGFTTPTAMTPPALAFTHSPPEim'GISST 
ISmSRTimTDVffiELAQASTQTLKSTIASFXIT^SKSHQSTTTRKASL^ 
I^VPISPPFTQRAVTDTRGDSHFRIMTNT^ 

HSTPMPALTTVKSQNSKLTPSPWAEQFWHKPYSDIAEKGKKPEVSMIATTGI^EATTLV 
SDWDGQKNTKKSDFDKKPVQEATTSKL^ 

IJCEAVGNPLPTIHWTRVSGI^l^RGNQNSRVQVLPNGTI^IQRVEIQDRFQYLCSASNL 
FGTDHLHVTLSVVSYPPRILERRTKEITVHSGSTVELKCRAEGRPSPTVTWILANQTVVSE 
SSQGSRQAVVTVDGTLVLHNLSIYDRGFYKCVASNPGGQDSLLVKIQVIAAPPVILEQRR 
QVIVGTWGESLKLPCTAKGTPQPSVWVL^DGTEVKPLQFTNSKl^FSNGTLYIRNLAS 



SDRFTYECIATSSTGSERRVVMLTMEERVTSPRIEAASQKRTEVNFGDKLLLNCSATGEP 

KPQMRLPSKAVVDQGSWfflYPNGSLHGWEKDSGVYLCVARNmGDDLILMHVSLR 

LKPAKIDHKQYFRKQVLHGKDFQVDCKASGSPWEISWSIPDGTMINNAMQADDSGHR 

TRRYRIJ^NGTLYFNKVGVAEEGDYTCYAQNTLGKDEMKVHLTVITAAPm 

IDIIKAGDTAAVLDCEVTGDPKPKIFWLL^^ 

EYVCVARhn > SGDDTKMYKIX>W 

SPEVMWIMPDMFLTAPYYGSRrrVHKNGT^^ 

VI£MLRRPTFRNPRNPFNEKIVAQLGKSTALNCVSVDGNPPPEIIWILPNGTRFSNGP 

QYLIASNGSraSKTTREDAGKYRCAARNKVGYIEKLVIIJEIGQKPVILTYAPGTVKGISGE 

SliJLHCVSDGIPKPNIKWTMPSGYVVDP^ 

QNSVGHTLITWVMIVAYPPRITNRPPRSIVTRTGAAFQIJICVALGWKPEITWEMPDHS 
LLSTASKERTHGSEQLHLQGTLVIQNPQTSDSGIYKCTAKNPLGSDYAATYIQVI (SEQ ID 
NO: 16) 



Figure 19 




Signal Peptide 
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TCTAGAA GTAAAATOATCCTGAGTA 

AATCATCrTCTCTGTGGTT^^ 

ACCTACOjAATAC^GAGCTATGACG^ 

(n^GCTGOTCAGAACATACTAGATG^ 

CTTAAGAATATGGTTGTCraTC^ 

GGTTAGTGCTAACTCTGACKKrr^ 

atcacntggtgattagcaactggatatggaatattact 

ctcagctttggcaqctccatix^txct 

actgtttctacttacaagacgcattac^ 

gaagagtcaattagacatattctoatgaagtaagcatataaagtoa 

tgttccatgtatgctc^tggatgctamtaato 

agcttatgatacttgtcacagagtaaatcttccataaatatc^ 

titcataatccatcmitaaaa^ 

ctgccctaggatggtctgtatggt^^ 

aggagcacatcttcagaacatctg<xtcaaaacat^ 

ac^cacatgtgcttcaaccac^gg^ 

ggaagcctgctagaggaacaggatgtatcagga 

ataaccctcaccacaagaaacaac^ 

gaactaagccatatccagaaaaagtattatatottctcat^^ 

agtcaaatatccagaatatagacaaagatgtf 

cttgtagattagaggacacaaagcaacaactacagagtgaagtaatccagagacttaatc 

TATAATATGAGGACTGTATTTAATAATTCT^^ 
ACTAACACACACACTTACATAGAGAGAATAMGTGA 
TAGCTGATAATrTCATATTGTACACXnXIAAA 
AQGTGCCTCTCXX^CKJQCGGAAGAGTACATTCQAM 

ccacatgoaggagctagagaaagtagccaaggagctaaagggatct^ 

tggaacaacattatgagctaac^ 

aaaagatggcctaatcggccatca^^ 

atgccccagtacaggggaatao:agggcc^aaaaggg<k^ 

g<k3gtgggtogatatgggggactrtixjgtata 

ctaataaaaaatogaaaaaaaaaaaaaaaaaaaaaaaaaggaaggt^ 

ttcactgctatctcaacitgcaaacagaaggggagtca 

gattgaagcgtctitgmtot^ 

TGTGAAAACTTAOKKKjTCT^^ 

ATATTTGGCACCACTACCAGCCTC^ 

CACAC&CACAt^CACACACACAC^C^^ 

(^TCATCTGCTCACTC^CCCAC^ 

CTATTTTrATGTCTAGGITCXXXXri^ 

AGAATTTGTAGC(XCTGAGCCAGCCCGGGCCACAGAGa^ 

CAG<KKnTACCAGCACAGGAGGAGCAAACTCGCAGGCGA 

CCCGGGTCX3ATGGCCCGCCCATTCCCAGAAGCCGAGGCT 

OCTCCTaXGAGlXj^ 

TTGGCC£AAGG<^GAGGTGAG<XKK^ 

AGTCTCKITAGATOIXKKMGCTGGGGTTCA^ 
GGlXXlAGGAAAAGGAAAGTACrGGAGGGGAGTlXKKiACA 

c^tcgcttcagtqactgaagccag<k:aaaaggag^ 

0>CTTTCATAAAC^CTGATO^^ 

GGGACGTCGG AA AGAAGGAGTG ATCGATTAGTACTTQTA A GTTTAGGTCAGTTTGAGAAG 

TAACTAACCTATACTATTGAGGGAGAAGGAAGAGCAT^ 

GCAATCAGATAAAOTAAAGCTTr^^ 

OAA(KX3CCTCCAGTTCTC^ 

OCCTTTTGCTCCCGTIt^^ 



AGATTTCCTQTCTCTGCITATGTT^ 

TATTGCCAACIOTTTGTT^ 

TTCATITACCKKJIXncrTCTXOT^ 

CAGCTACCTTGTA<XATCTTTGTITATCTAATAA 

ACGCAGAGAGTATTCAAAAATATTCAG^^ 

ATCATTAGTAAATCTTAATTTGGGTT^ 

CACKjAGCCTrCGCCACGTCKXSGGra 

CTIGGCCATGCTCCCCAT^ 

AGCAGACAaTGmTGCCTGCAAAGTCAGAGAC^ 

GIXXrTCCCGGTCATTAGTTGGCT^ 

AGACATGCTAGAAAGAAATTTGACCITnTrrcrAGTG 

TTTGGAAGGTTAAGCCAAATAAATAAMCACATATTAAATAATAC^^ 

GATCATATAAAGAAGTACATTCATAAATGCAATC 

TTACTGGTCKIAAAGTTTTCTAAATTGCA^ 

AAAATGTATATAAAAGTGTAAAAGGCTCTTGGTCA^ 

^r(BastH2) 

TTAATTXmXWATAACATTATTATGATCTCT 

AATGAGAAGCXATGAAGATATGTTTACMTAAT^ 

GGAAGCTAGCACACACnXK5TTTATAAAGAGA^ 

CTCTAGGGAATAAGAGTIXX7TCTTTC 

TTOTX3ATGATQ^G<^TTAT^ 

ATCKX^CTGAACAATCnTTCACW 

TTGGGGGATGGTGTTAATTTITITGCAGTTCOT 

AAACAAACAAAAAACCTCTGAAACTAGAACTACCAATCC^ 

AGAAATCTGCAC^GAATTTATTXKrrACATTO 

ACCAACTTACGTAG<XXjTC^AAATATGAACC^ 

AGAGTCCC^TGTGGCCATAAAAGAGTGAAATCATGACAT^^ 

GGAAATCAATTGGGCTAATCAAM^ 

CTCTOACAAACAGAAGCTAGATTTACAOT 

TAGTTATACACTATTCTAATCTGTGAGTGTGTATAAAGGCA 

AGCTAGCTGGGGTGGGTAGGAGAGAMGCAATGAGAGGAGTTAATAAGAAMAAGCA 

GTAACATAGGTG<XAGGATGAAATGCATTAATTTGTATGCT 

GGCACACX^TTCAAACCAQGGTQAAATCCCAGCACAGAGAAGGGG 

TTCGCCACTAA(XAAGAAGCCATITGCAGTTGCT 

CTCX^GTCTGACACTGTGTAT^ 

rTTTTGTGCTATTTrrCATTTrr^^ 

GCTGTGGTTTTCATTTGTTTCTIT^ 

^GCTGGAAACGATCTGGAAGAAGTTGGGGAAAGAGAAAAATT^ 

iCAAACAAACAAACAAACAAAAGGTT^ 

rAGTTACmCTCTTAAAGAAAATATTCCC^ 

ITAGGACTTTGCTTOAACTGGC^ Artificial 

Nod lite 

(SEQIDNO:17) 



I 



Figure 48 



CACAGACCTnxnxnTCT^ 

CAGCAAGCACACICTGCTAAT(X!AAGGG 

G<^GACCCACACXOTT<XAACTGCTC^ 

GAAACCTQCTQTQATCTCXJATTrCTTCTCT 

TTXXnX^UACClTrCCTCl^ 

TCATTGGGAACCCTQTOCTCCATCC^ 

CTTOOTCAGTGCAGTCTTITGTA1XXTO 

AACTGTTITrTGTTATGATTGCTGCAATCT 

TCCTTTGIXnxXTCCTn^ 

ACCAAAQTGATATAGAITCATGGCATTOT^ 

ACTTGGCAGGTGTAAMGCCTGGAAGCAGTC^^ 

CTGGAACCTTGGCATTCCCATAGCTAGAATGC^ 

GGTCTTGCATGCmCTTCK^ 

TAGCAAAAGTATACXnAGAATCTT^ 

GAAGAAAGTTGGTAAATTCITCTGACAAAT^ 

AGCTAAGTCACACTCATATGCAAGAATTTACCAAGGCCTAG^ 

GCTTTATTCATGATAAGGTCIGCT^^ 

AAAGGTGTAGCATAAATTGTOACTAGAGTGTGAAATCm 

GGCAGAGCroTTTTATCTITACTGTAMCATTAW 

TanX^TTTTGTGACAGATACTTCT 

TITCTGTATTATATAAGCCTGGTGTT^ 

CCTTGTGTCTGTGTCCTTTTGTCATTTC 

GTCTTrTCCACAGATTGAGG^^ 

GCTC&CTGTTGGTCACTACC^^ 

AAGTGTGGGTTCAGGAAGCAGGTKnACACXX^ 
AAATAAGTTAAOXATCTCTTamXj 
TGCTXX3TGGTGTGTCTCXATCAGAGGGCAGAG 
TGTCCATTGTlTCrrTCAG 

AATCnXKKjAATGACTTITCrAAGAAATGCCACGTC 

TAAACAAGATATATGGTGCGCAGTITATAATCATAATAAGCTTTGAAATAATATATA^ 

CATTCTCATTCTGCTTCCA03^ 

AAAGTAGTGATTTATGACCTGCirTGTTCTGTTACT 

ATGTTGAAACATGGAACTIXSGGGTGACCT 

TTTGTCTCCAGAATAAATC^GTTTATCAA 

CGCCCX3TGGAGTCCCACCATTCTACTO 

ATGTGATTCTTCCAACTCCAAAT^ 

ACTAAGTCTATTGATITGGAAGATCTCAGTCK} 

TCAGGCTAAGCTOAAAAATATGACAAAl^^ 

AGTGmGGGGGnTTAAAGAAACAAATACCTACTCTAMTA^ 

CTATTCTAGTITTGAATAATG^ 

TTATCTTGACCAATCTC&GTGACTTGAT^ 

AGGTCTC^TTCAACCACATGTTTATC^TTTGGGATCT 

ATTCTlXXAGAGCACim^^ 

TGTGC^AAACAGAGAGmCAAGACCnTTTATOTM 

ItSAC^GAATGTCAGAGTGGAGCIXX^ 

TTGGAACATCAC^CAAAGAAATTCCAAATTATO 

ACTTTTCACrCrrGAACATGAGCTT^ 

GTGAAGGAAAGCTCGCCAACTCCTTrrTATATGTAACA 

GACAGTATGCC^GTCCCAAGATC^^ 

AAGGCAGGAGAAGGCAAGCTACAGTGAAQCCCAGTTC^C^^ 

AGATAAGAAGGGTTTCTAATTTTTAAA 1 11 1 1 1 1 lATTAGATATTTTCTTCATITACATTTC 

AAATGCTATCTCAAAAGTCXX!^^ 

CCATTTttTGG<XCTGGTTAOT 

ACTATCTGGTCTATIGCAAGMTGGCTGCCGAGCTO 

AGCC^ACAGTGGAAGGAACTTGGGGACIXOT 



<XmXK3ATGG<XHAAG<3AACTCX^^ 

GCCTCTCCACICATATGTMCAG 

GATCKXXKJTTAGGGGTGGXKMTGGGGGmTCTC^ 

TCTTOTACimCKriXKXJrAGTCTGGOC^ 

GTAAAGACTTGAACTTCTGAGGTGTCGGTGGAGGGTAT^^ 

TAGGAAGAGAA0AGGA<KKKK3AAGACT1TKK3<^^ 

ATGAATAAGAAAAAAAAAATAAAAATAATTAAAAAAAAAAAAGAA 

TACTtinAAAACCATTGCATCCXXXXJCOCCCAATCAT^ 

GATACTAACAATAGATGTAGGAAACTATTGlTAACTIUr^^ 

GAACCTTmTAn"GAAGCTrrCACA<^GAGCXrrrG^ 

TAGCTTAATGATGTTCAGTGAArrAAAAATMCA^ 

AAGXIATTCXreGTATATATTTAAAAA 

TTTCrATATATGTITACTCrATOGTATGTTAATTAA 

CAACATAGTATGTAACTTATAAGAAAGrAAAAC^TTC^TGAAATTGTGAAGG 

CTTACra^TCAGAAACACTCGGTlTGAATAATTCT 

AGAAIXjTAATAACCTACTAACXK^AACATACAAG^ 

GAAGTAATAAGGCAAATAATGAATATTTGCITrAAATAGTTOT 

CXKJTGTGATTCTXXJAQACTTQACTCCATCCA^ 

TTQAMCAATX^TTGAAAATMTAAA(^rrTCX}TACT^ 

TTGCITrrrAlTaCTGTOATAAGGACCATGACX^AAAGCAGCTTA^ 

(SEOIDNO.18). 
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TAAGCCTTTTGCTCCCGTTGGAAGCAAAGAACGTTCOTCAATCAGGTGAAGGCTCT 

CCTCAGAAGATTTCATGTCTCAGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGA 

AGAGCTCAGGTATAGCCAACTCTTTTGTTAAATACAGTATGAGGCTTAAGTGTACGG 

CAACTCATGTGGTATTCATTTGCGGCTCTCTTCTCTTATAACTAACTCTTAAGGTGCA 

TATAGTCTCTTCTGTTTCCCAGCT 

AGCTCATCTGCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTA 
ACAGTGACAGTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATATGGGTTAAACT 
CATTCATAACAGCTCCAGGTTGG (SEQ ID NO: 19) 



TTTGGAACCAACCCAGATGCCCCTCAACAGAGAAATGGGCCAGAAAATGTGGTCCA 

TTTATCCAATGGAATACTACTCAACTTATTAAAAACAACGACTTTCATAAAATim 

GGCAAATGNATGGTCTGNAGGATCTTGAGTGAGGTAACCCAATCACAAAAGAACAC 

TCATGGTATGCACTCACTGATAAGTGGCTATTTGTCTATGGAGTGATTTAAAAGGGA 

AGAAGACACATAGCTTTTTGTGTGTATAATATTAAGATGGAAATTTGCCAGTGCTGT 

TTGGCTTATGAGTGAATCTTGTTTCAGTGGATTACCGGAAGAAAATAATAAGTGAAC 

TGTAGGAAGAAGTAGTTAATCAAGGTGACAAAGTATCCTGACACATTGGGAAAAGA 

CCACAGTCCAGGAAACTGAGTCTTAAGGATTCATATTAACTCCAGTTCCCCATGTGC 

AGCTCTGAGACTTTGGCAGATCAGACACTTAACTTCACCAGCTTCCTACACAGAGCA 

GTTACTATCCTTGCCTTCACACATGGAGTGTGCCATTAAGTGCCTGAACATGAGTCT 

GACTTGTTAATAATCTTTAAAATCCAATTGTGTGTAAAGTATGTGACCAAAGAGCAT 

GGTCATGCTATTAACCTTTGATGTTCTATGGACTCTTAATTTTATGGTAGAAATGTCA 

ACAAGCTTGTGGAGGCTGGAAGATACAAGGCTTAAGAGGATGGCCTTTCAGTTTTG 

AAAGTAATTCAGTATGTGTTCTGGCATCCCTTTTCCTAAAGCAATTTAACCCCCCAA 

GTAGGCATAATTTTAATGCTTACTTCATCAGAATATATCTAATTGACTCTTCTAAAAA 

GACTTTGGTATGCATAGGATCTAAATGTAAATGTGATTTACTGACATAATAAATAGG 

AGAAACTGAGCTAGAATAGGTATAAAATATGTGCTGGCTTTCTAATAGGTCTTATAG 

GTTATATAAGAGGTGGGAAAGGAATATTTGAAACATCTAGAAGTAAAATGATCCTG 

AGTAGCGATCCTGGGAAAATACGTACTCTAACACACTGCAATCATCTCTCTGTGGTT 

TGCTGGAGCTGAGGTCTGGAAGGCTCGACCTTGGTTAGAAATAACCTACCGAATAC 

AGAGCTATGACGTTAGTCTGGAAGGAGCTTTGGAAGAATGACAAGCTGTAGCTGCC 

CAGAACATACTAGATGCCATATTTCCAAGGCAAGTGTCCACATGCGGACATCTTAAG 

AATATGGTTGTCTCTGCAGTGCTAAGGACCTTGTTCGTGCCACACAGGTCTCCAGGG 

TTAGTGCTAACTCTGACTGCTTGACTCTTTAATTCTCCCTTGATCATTAATGACTAGA 

AATCACTTGGTGATTAGCAACTGGATATGGAATATTACTrAATTTGTACCCAAGCCA 

GGCCACCTCAGCTTTGGCAGCTCCATTCATTCTGTGGAGCCCAGTCACGTGGGTTTG 

AATCAACTGTACTGTTTCTACTTACAAGACGCATTACCTGAGATGAGTCATTTTTCTT 

CACAAGTCTTTTTAGAAGAGTCAATTAGACATATTCTGATGAAGTAAGCATATAAAG 



TGAGAGCAGCATGAATGTGTTCCATGTATGCTCATGGATGCTATTATAATGTGGAAA 

TAAACTGACTITAAAAAAAAAAGCTTATGATACTrGTCACAGAGTAAATCTTCCATA 

AATATCATCTGCATTTATAAATTATTITCATAATCCATCAATTAAAAACCTTTAGAAA 

TTTTGTTAACACAAAGATCCCTAGGCCCCTGCCCTAGGATGGTCTGTATGGTGGGCC 

TGAGAGATGGAGCTTAAGAACITACTTGCTCCAGGAGCACATCTTCAGAACATCTGC 

CTCAAAACATTTATCCCAAATGCTCATCAAAGGCTCACTCACATGTGCTTCAACCAC 

AGGGATTAAACAGTCATTTTAGTCACATTTCTCAAACGGTGGAAGCCTGCTAGAGGA 

ACAGGATGTATCAGGATAACATCCAACCTTACAAAAGGATGTCATAACCCTCACCA 

CAACAAACAACAACGACAACAAACCCATAAAAATTATCACGGCAAATGAACTAAGC 

CATATGCAGAAAAAGTATTATATGTTCTCATTGTGGGGTGTTTTTCCTTAATAGTCAA 

ATATGCAGAATATAGACAAAGATGGTTTATGCAAGTGGGGATGGCGAAGGATACTT 

GTAGATTAGAGGACACAAAGCAACAACTACAGAGTGAAGTAATCGAGAGACTTAAT 

GTATAATATGAGGACTGTATTTAATAATTCTATTTAAGATACACAGCAAACGAGTGT 

ATCTTACTAACACACACACTTACATAGAGAGAATAAAGTGATAGATACGTTTGTTTT 

ATCTTCATGTAGCTGATAATTTCATATTGTACACCTCAAACATAGATAACCAACAAA 

GAGGAAGAGGATAGGTGCCTCTCCCAGGGCGGAAGAGTACATTCGAAAGTCAGACA 

CCATTGTGTAGATGTACCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAG 

GGATCTGCAACCCTATAGGTGGAACAACATTATGAGCTAACCAGTACCCCGGAGCT 

CTTGACTCTAGCTGCATATATATCAAAAGATGGCCTAATCGGCCATCACTGGAAAGA 

GAGGCCATTGGACTTGCAAACTTTATATGCCCCAGTACAGGGGAATACCAGGGCCA 

AAAAGGGGGAGTGGGTGGGCAGGGGAGTGGGGGTGGGTGGATATGGGGGACTTTT 

GGTATAGCATTGGAAATGTAAATGAGTTAAATACCTAATAAAAAATGGAAAAAAAA 

AAAAAAAAAAAAAAAAAGGAAGGTCAGACACCTCACTTCACTGCTATCTCAACTTG 

CAAACAGAAGGGGAGTCACAAACCCAGGACAAACCACAGTGATTGAAGCGTCTTTG 

AATGTTATTGCTGTTGTTGTTACCACCATCATTAGCATATATTCATTGTGAAAACTTA 

CGGGGTCTATGACATGTTTTTTTATTCAAGTATATCACATGCTGTCAGCATATTTGGC 

ACCACTACCAGCCCCAGCCCCCTTTGCCCCGCCCCCAACACACACACACACACACAC 

ACACACACACACACACACACACACACACACACACACACCTTTACCTTCTCCTGGGCA 

TCATCTGCTCACTCACCCACCCAAGCTTAATCCTTTTCCTTCCCTGCAATAGTACCTC 

TCCTATTTTTATGTCTAGGTTCCCCCTCCCCCTGTTAGGAGATGGGAGAGGTCACGA 

AAGAAAGGAATTTGTAGCCCTTGAGCCAGCCCGGGCCACAGAGCCTGCCACCAGAC 

AGGAAAAGCCCAGGGCTTACCAGCACAGGAGGAGCAAACTCGCAGGCGAGCCTGG 

GTTGGCGCTGGTGGTCCCGGGTCGATGGCCCGCCCATTCCCAGAAGCCGAGGCTATA 

GCTGCGTCACCTGCCCCGCCCTCCTCCCGAGTGAAGACCCCTAGAGGCTGAGCAGAC 

CCCAAAGGCGGTGCAATTCCATTGGCCCAAGGCAGAGGTGAGCGGCTGCTAATCCC 

CTCGGGAAGTGAAGGGACCCAGAGAGTCTGGTAGATGTGGGAGCTGGGGTTCAGGG 

CGAGACAGAGGGTGGGATGGGCAGAAGGGTCCAGGAAAGGAAAGTACTGGAGGGG 

AGTTGGGACAAAAGCAGCGACCAAGGGAACATCGCTTCAGTGACTGAAGCCAGGCA 

AAAGGAGCGGGAAGGATTATATGTAGCCTGGGACGCTTTCATAAACACTGATGACG 

TGTTTGTGCAAAGCAAGCAATTTGAGGAGAAACGCCTGGGACGTCGGAAAGAAGGA 



GTGATCGATTAGTACTTGTAAGTTTAGGTGAGTTTGAGAACTAACTAACCTATACTA 

TTGAGGGAGAAGGAAGAGCATTCCAGCAGCAGCAGCAGCAGCAGCAATCAGATAA 

AGGAAAGCTTTGKjTTAGTTTGGAAATGTATGATACCATTAAAATAACAGAAGCGCCT 

CCAGTTCTCTGAAGAGTCAGTCCCCCAGCTAGTGAAGACTAAGCCTACTAAGCCTTT 

TGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCTCCTCAGAAG 

ATTTCCTGTCTCTGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGAAGAGCTCAG 

GTATTGCCAACTCTITrGTrAAATACAGTTTGAGGCTrAAGTGTACGGGAACT 

TGGTATTCATITACGGCTCTCTTCTCTTATAACTAACTCTTAAGGTGCATATAGTCTC 

TTCTGTTTCCCAGCTACCTTGTACCATCTTTGTTrATCTAATAATAGCAAGCTCATCT 

GCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTAACAGTGACA 

GTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATTTGGGTTAAACTCATTCATAA 

CAGCTCCAGGTTGGGAGGGATCACTGAGCCTTCGCCACGTGCGGGTTAAAGATATTT 

TCTAACAAGAGAAGCAGAATTCTTCCTTGGCCATGCTCCCCATCACTGTGTCAGTAA 

GCAGAGGGGTGTTTCCAAGCAGAGAAAGAGCAGACAGTGTTATGCCTGCAAAGTCA 

GAGACTCAGCCCTCCCAGCTGGTCAGTITACTGTCCTCCCGGTCATTAGTTGGCTCTG 

AAAAGGCCCATGTGTCCTTATTGGCAAGGACTTGCAGACATGCTAGAAAGAAATTT 

GACCTTTTTTTCTAGTGGGTTATTACAGCTGTAAAAGTATTTTGGAAGGTTA 

AATAAATAAAACACATATTAAATAATACAATGTTACAAAAATTGATCATATAAAGA 

AGTACATTCATAAATGCAATGTGAAAAATATATATAATTTTTATCTATTTACTGGTGC 

AAAGTTTTCTAAATTGCACATGTACTATTT^ 

TATATAAAAGTGTAAAAGGCTCITGGTCAAACAAGAGAGTrAAATTTACAMCTTTA 

ATTGTCCCGATAACATTATTATGATCTCTAATGACAGGGATCCTGCnTTTCATTGGGA 

AATGAGAAGCTATGAAGATATGTTTACAATAATAAGCCCATTTAGTGATAAAGTCCA 

ATGGGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTTCCTGAGTCTATGCAA 

GTTTACACTCTAGGGAATAAGAGTTCCTCTTTCTCCAGATTTCACTAGCATTT 

CATCATTTATCTTCTTGATGATGAGCATTATAAGTGGAATAAGATAGGATCTCAAAG 

GAATGTCAATTTGGATGCCCTGAACAATCITTCAGGTCTTTCTTTCAGTTCACTAGTC 

TATTCATTTATTGGATAATTGGGGGGATGGTGGTAATITrTTTGCAGTrCTrATGGAA 

TTCCAAAAAACAAAAAACAAACCAACCAACCAAAAACCTCTGAAACTAGAACTACC 

AATCCATTACTGGGTATGTAACAAAGAGAAATCTGCACAGAATTTATTGCTACATTG 

TTCATTATTCACGACAGCCAAGAATGTGGAACCAACTTACGTAGCCGTCAAAATATG 

AACGGATAAAGAAAATGTGGAAATGTGTACAACAGAGTCCCATGTGGCCATAAAAG 

AGTGAAATCATGACATATGCAGGAAATGGATGCAACTGGAAATCAATTGGGCTAAT 

CAAAACAAGACAGACTCAAAAAGGAAACACCGTGTAGCTTCTCTGACAAACAGAAG 

CTAGATTTACACnTGTACGTGCGCATGTGTGTTTAGAATTTTATTTAG 

TTCTAATCTGTGAGTGTGTATAAAGGCATGCATGTAAAGCAAAAACAAGCTAGCTG 

GGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATAGTAAC 

ATAGGTGCCAGGATGAAATGCATTAATTTGTATGCTAACTAAACCACAGACAGGAG 

GCACACGTTCAAACCAGGGTGAAATCCCAGCACAGAGAAGGGGAAGTAGACACAA 

AGTTTCGCCACTAACCAAGAAGCCATTTGCAGTTGCTGCCTGCTGGGAAGGGGCGTT 



CCAGTTTTCTCCAGTCTGACACTGTGTATAACAACCAGTTGACAATACAAAGTTGGC 
ATGATGGATGGTTTTTGTGCTATTTTTCATTTTTTTTOT 

GGTTGTTGTGGTGGTGGCTGTGGTTTTCATTTGTTTCTTTTGAGAGAGAGAAGGAAC 

ATGAAATTGGGTGGGTAGGAAGCTGGAAACGATCTGGAAGAAGTTGGGGAAAGAG 

AAAAATTGTATGGAGCATATTTAAACAAACAAACAAACAAACAAAAGGTTCATTTT 

GCCACAAAAAGGTGTGAATTAAATTAACCAGTTACGACTCTTAAAGAAAATATTCCC 

AATTATTCCCAGAGTTGCTATGTATGCTGTGCCTAGGA(nTrGClTGAACTGGCCCTA 

TAACTCTGGTGTGGTGTCTTTTCAGGATGCAGAAGAGAGGCAGGGAAGTCAGCTGCT 

TGCTGATCTCCCTCACTGCCATCTGCCTGGTGGTCACCCCTGGGAGCAGGGTCTGTC 

CTCGCCGATGTGCCTGCTATGTGCCCACAGAGGTGCACTGTACATTTCGGGACCTGA 

CCTCCATCCCAGACGGGCATCCCAGCCAATGTGGAACGAGTCAATTTAGGGTGTGTG 

GACCTTGCCTGATCTCCTTCTCAGAGAGGGACCACTGATTTTCCTGGTACTTTGCCCC 

CCAAACACCTGTGATTACTTTTAATAGTTTTCTTCTAAAATGGGTTCATACAAACCTT 

ATATTGTGGAGACAATGAACATTTTATCCCAATAGTCTTTTACTAGAACTTGAAGCC 

CCTCTTAGTTGTTTGGGAGCCTCATAATTATGGGGCAGCTTTATTCTGAATGAATTTT 

AAATGAAAAAGATACAGTTTCTGTTAACAATCATTATGATACCAAGGAAGAGGAAT 

TGTCATTGAATATTTTAAAAAAGCATTTCTI^ 

ATGGCTTACITAAAATACTTGCCTTACTAAATCTGACAAATTATGGTGATATTTTGAA 

GGTTTATGAAAATTTGTTTATGTGTATAAATGCACAAGAAATGGGATATGCCATCAC 

CTATGTGCCATTAGTGAGCATGTACAGTATGCCAAACACTATTGTTCACGTTTGGAG 

GAAGTAATGGGGGTGGGGGAGCAACAAGGGTTATAACCGTATACCCAGTGCCTTGG 

AAGCGATTGCAAACAGTAAAGACTGACATTGTGTTCTCCCTATGAGGGAGGGGCCTT 

GGGCTGAGCACTTTGCAATGAGCATTTGCTCATTGTGCTGGCAGGTTTTATGATAAC 

TTGACCCAAGCTAGAGTCACTGGAGAGGAAGGAACTTCAACTGAGAACATGCCTGA 

AGAAGATCAGATTATAGGCAGGCCTGTGGGGCATTTTCTTAATTAGTGATTCATGGG 

GCAGGGCCCAGTCCATTGTTCGTGGTACCATTTCTCAGGCACTATTAAAAAAAAAAA 

AACAGGCTGAGCAAGTGTCAAGGAGCAAGTCAGTGAGCAGCAGCCCTAATGATCTC 

TGCATCAGCTCCTGCCrCCAGGTTCCTACCCTATTTGAGTTCCTGTCCTAGCTCCCTA 

CAGTGATGAACAATGATGTGGAAGTATAAGCCAAATAAATCCTTTCTTCCCCAACTT 

GCTGTTGGTCATGATGTTTCATCACAGTGATAATAGTCCTCATGAAGATGCTGGTGT 

TTATAACACCTTTGGACTAAATTCTGTTATCTATAGCTGAGGAAAATGGAGCATAGA 

AAGTCTCCAGACTACACCAGAGTGTAATCTGGGCCTGAGCTTAGAATCACACCCAC 

GTGCACTCCACTGCCGGGGCTTCTTAACCGGAACACAGTTGTAAAAGGGAATTTTCT 

GTTTGTTTCCATTTTGACATGTGGACTITAATrGACGATTCATCTGAAGCTGAAAATG 

Alillill iCCAGGTATAACAGCCTCACTAGATTGACAGAAAATGACTTTTCTG 

GAGCAGACTGGAGTTACTCATGCTGCACAGCAATGGCATTCACAGAGTGAGTGACA 

AGACCTTCTCGGGCTTGCAGTCCTTGCAGGTGAGATAGGTAGAGGGTGATGGAGGC 

TGAGAAGAGAGGTGCAACTGTGGGTTATACCCAAAAGCTGCTGATTCCCGTGGGAG 

ACATTCTATAAGCATTCTATAAACTAGAGGCAGATATCAAGGAAGGATTTCAATTGT 

AATGCAATTTTATGAGAAAATTTGAATATTAAGAAAATGCTGGGGAAAATGCTTAC 



ACAATTGCGAGGACCTAATTTAGGATCTCCAATAGCCACATAAAAAGCACAGCATG 

GCGGCAGACACCTGCAATTCCTGTCCCTGGAAGCACCTGTTCAGAATCCCAGAGACT 

CATTGGCCAAACACTCTATTCAATCAATGAAGTCCATATTCAGTGACAAAACTTGAC 

TCAGAAACTAATGTGGAAAGCATCAGGAAGACAGCCAACATCTGGTCTCTACTCAT 

GCATGAATAAGGGATCCCAGAGAGAAGGGAAGAAAAAGGAAGGAAGGAAGGAAG 

GAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGAGAGGGAGGAAAGGAGGG 

AGGGAAGGAAGGAAAGGGAAAGGAAAAAAGAGATGGGGAGGGAGGGAAGGAAAG 

GAAAGGGGGAGAAAGAAGAGAAGAAAGGAAAATAAATAAATTTTCAGGGATTATT 

ACACCTTTAAATTTTATCCATAAAAGGTCATTTCCACCTGTTTGTCTGGAAGTAGAGT 

GGGATCCCTTATATAAGGGCAGT(mTAACATAGTAGCATTTTATAAACCATTACAA 

ATTTTGAGTTTTCTCTACTTTTTATCCTCTACCATCTTCAAA 

CCCACAAATGAAGAAAATGCTGTAAGAGTTTTCACACACCGAAGTGGGAAACTTAA 

GGATTAGACAAGTCTAACAATGAGAATGGGGAGAACAAAAAGAGACTGCACAGGG 

AGCCCTTTCTCTGCTTATAATCTTGACACTTGAGAAGCTAATTGACGCTGCATGACTA 

CTCAACTCTTTAAGCAAACAATGCTGTTGTTCATGAAAAGCACAATAAAGTACATAT 

GTCCCATAATATTCATCAAAATTTGCATGCAGCACATAATAGCAATCAAAGCAATAA 

CACCCACTGTTCACAGAGACTTTAAACATGAAACTGGAACTATGTCTAGTGTTTTGA 

CTTAGGGTACATAGTATGCTGTGTCTGTATGTACCAATGTTGATTTAGGTCATCAGA 

CAGCATTTGGAACATGTATCTTCAGGAGGAATCATTCATGTATCCTGCATGAAATTC 

TCCACCTATGTTTATTCTCTTAGCCAGGTTTTTCTCTGATGGAGAAACATTGGGTTTG 

AGGTTTTACTCCCAGGTAACATTTAGGGAAAAGCTGTCTATGTTCTCAGTTTGGCTTT 

TATTTATGAGGGATGTTGGTATTCCAGAAAATTCTCTTTTGAAGAGATTACAATTTA 

GGTCAAAACAGAAAAATATGTAAAAAGTTATTGTTTTTATTAGTATTTCATGTTCT^ 

TOTTITrAAAAATGGTATGCTTAGAACTAATTAAGATTAGATTAGATTAGATTAGA 

AAATAATCAGAGAGGGATTTGATGAATGCTAAAGCATCATGAAAAATTCAAAATTT 

TTTGCTTCTAATTCAGAATCAATTAAATTCATATTACTATAAAAGACAGCACGCCAG 

ATGTGTGCCAGCTGAGGAGTGGATAAACTGTGTAACGTGAGTGCTATGTAGAAACA 

GAAAGGAGTGAAGGGTTGATGTGCGCTGCAACATCTTGAAAACATTCGGCTACATG 

ATGGAAGCCAGGCACAAAAAGCCACATATTGCATGGTTATGTTTATATGAAATGTTT 

AAAATACATGGATTCTTAGCAAACAGAGTAAGATGTTACTTAGGGTCAGGAAAAGA 

TTAAAAAAAAAAAAACTATTGATGTGGAATGATCTTAATTTGGGGAAAAGACAATT 

TCCTAAGACGAAATAGTTGAGGTAGATATAGTTATATCCCTGTGGATATTGTAATAA 

ACCAGCATGCTGTGCTCTGAGAAGGGCCTAATGAAGGGGCAGGAGGAAGTGAAATG 

AGATGGTAGAAAGGAAAGTCATATACCATGGCTTCTCTCGTGGGTGGAATCTAGAT 

ATGTTAATATATTGACATAAAGGAAGGAATTGTTTAGGGAAGGATCAAAACCAACA 

GGAGTGAGGGAGACAATAGGAACCAATGAGAGGCAAAGTTCATGGTCAATGTGTGT 

GGAGACACCATAATAAAACTCCTTTTTTGTTTGCTAACTAAAACCACTAAAA 

AAACAAAACATTTTTGCACAAGAATTATTTATTATTCAATAAAGATGTTTAAATGGG 

GGAAGTTGAAGTTCATTGATAGTCTCATAAATCTTAAATGTATIJAAACTGCTTTTTA 

CGTTTTTTATTATTAATTACTCTrGCTGTCATTATTATCATCATCATTA 



CATCACTAATGCTTTTCACCATACACAAATGTAGGCAGAAGAGTGTAATCCACTTAG 

TGAGGCAATCTTGGAGAGGGAAAGGAAGCGGATGCGGGGCAGAGGCACACAGGAG 

GACAGTGAGAGGGAAATGAACAAGAAAAAATGTGGACACATGCACAAAAATTCCA 

TAGTCCACTACATTACTTTGTATTCTAATATTAAGAAAATAATAAACCCATTTCTGTG 

CACTTATCACCCAGGCTCAACAGTTATCTTGGCCACAGATCCTGTCTCACTGCATCCT 

GTCCACCTGAGTCCACTTAGCGTTCTGAATCCAATCCAGGGCATGATGCITACTCCT 

ACACAGAACTAAAGATTAAAGAGAGTTTAAAAGTAACCATGACATCTCTCTGTTCCT 

TTAGCGATAAGTTCTTAATATTTATGGCTGCTTGTGTATGTTCTAATTTCTCTAATATT 

GTCACATTTAGTTGGCAACTACTTTGTTTGAATTGAGTTGGAGTTAAGGTCCCATAG 

GATTAATCTCAACATATTTCTATATTTATAAACTTTTCTCTCTTTGTGAAAGTTCCm 

GAGAAAACAAATATGCCCATATCnTTCTTTACAGGTCTTAAAAATGAGCTATAACAA 

AGTCCAAATAATTGAGAAGGATACTTTGTATGGACTCAGGAGCTTGACCCGGTTGCA 

CCTGGATCACAACAACATTGAGTTTATCAACCCCGAGGCGTTTTACGGACTCACCTT 

GCTCCGCTTGGTACATCTAGAAGGAAACCGGCTGACAAAGCTCCATCCAGACACAT 

TTGTCTCTTTGAGCTATCTCCAGATATTTAAAACCTCCTTCATTAAGNACCTGTACTT 

GTATGATAACTTCATTGACCTCCCTCCCAAAAGAAATGGTCTCCTCTATGCCAAACC 

TAGAAAGCCTTTACTTGCATGGAAACCCATGGACCTGTGACTGCCATTTAAAGTGGT 

TGTCCGAGTGGATGCAGGGAAACCCAGGTAACTATCTTGTTTGT^ 

ATARKACGTATTTTCCTCAATTTCATTTAGAATGATATCCCAAAAGTCCCCCATAACC 

TCCCCCCCACnTCCCTACCTACCCATTCCCATTTTTTGGCCCTGGCATTCCCCTGTACT 

GGGGCATATAAAGTTTGCGTGTCCAATGGACCTCTCTTTCCAGTGATGGCCAACTAG 

GCCATCTTTTGATACATATGCAGCTAGAGTCAAGAGCTCTGGGGTACTGGTTAGTTC 
ATAATGTTGTTGCACCTACAGGGTTGAA (SEQ ID NO:20) 
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MPKRAHWGAI^VVLILLWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIARHVE 

PJNLGFNSIQAIJJETSFAGLTKIJIsLLMIH^ 

QTLQGLSNLMRUDDHNKIEFIIffQ 

LSTIRHLYLAENMVRTLPASMLRNMPLIJ^Y^ 

KCKKX)KAYEGGQLCAMCFSPKKLYKHE]HKLKI)MTCLKPS1ESP1JIQNRSRSIEEEQEQ 

EEDGGSQLII^KTQLPQWSISLNMTDEHGNMVNLVCDIKKPMDVYKIHLNQTDPPDro 

ATVALDFFCPMTRENYEKLWKLIAYYSEWVKLHRELMLSKDPRVSYQYRQDADEEAL 

YYTGVRAQILAEPEWVMQPSrDIQLNRRQSTAKKVLLSYYTQYSQTISTKDTRQARGRS 

WVTVDEPSGAVQRDQWI^GGPCQI^CNVKASESPSIFWVLPDGSILKAPMDDPDSKFSIL 

SSGWLRIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQSPSTQPAEKDTVTIGKNPGES 

VTIPCNALAffEAHLSWIIj'NRRII^ 

NQQGADHFTVGITWKKGSGLPSKRGRRPGAKALSRVREDrVEDEGGSGMGDEENTSR 

RLLHPKDQEVFLKTKDDAINGDKKAKKGRRKLKLWKHSEKEPETNVAEGPJIVFESP^ 

E^MANKQINPERWADILAKWGKNL^ 

VQTWSAEESSADVPLLGEEEHVLGTISSASMGLEHNHNGVILVEPEVTSTPLEEyVDDL 
SEKTEEITSTEGDLKGTAAPTLISEPYEPSPTlini^^ 

PEFTSSEYEPPLDAVSLAESEPMQYFDPDLETKSQPDEDKMKEDTFAHLTPTPTIWVNDS 
STSQIi^DSTIGEPGVPGQSHLQGLTDNIHLV^ 

IJBGDPraSRSSESEGQESKSIIIJ^TLGIMSSMS 
QKVAPSSTMSTHPSRPJO>NGRPJUJlPNKFPJIRHKQTPPTTFAPSE 
QVESSLVPTAWVDNTVNTPKQI^MEKNAEPTSKGIPRRKHGKRPNKm 
SGSKPSPSPENKHRNTVTPS SETILIJPRTVSLKTEGP YDSIJDYMTTTRKIYS S YPKVQETLP 
WYKPTSDGKEIKDDVATNVDKHKSD^^ 

GTPTWNPSRTAQPGRLQTDIP VTTS GENLTDPPLLKELEDVDFTSEFLS SLTVSTPFHQEE 

AGSSTTI^SIKVEVASSQAETTTLDQDHIJETTVAI]JJSETRPQNHTFrAA 

TEMSLGQTTTTKPALPSPRISQASRDSKENVFLNYVGNPETEATPVNNEGTQHMSGPNE 

I^TPSSDRDAFM^TKLELEKQVFGSRSLPRGPDSQRQDGRVHASHQLTRVPAKPILPTA 

TVRLPEMSTQSASRYFVTSQSPRHWTNKPElTrYPSGALPENKQFTTPRI^ 

SKPSIPSKFTDRRTDQFNGYSKWGNNNIPEARNPV^ 

PQLGVTRRPQIPTSPAPVMRERKVIPGSYNRIHSHS 

I^NIPMVSSTQSSISFTTSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSPQTVSV 
TAETOTVFPCEATGKPKPFVWTKVSTGALMTPNTRIQRFEVLKNGTLVIRKVQVQDRG 
QYMCTASNLHGIJDRMVVLLSVTVQQPQILASHYQDVTVYLGDTIAMECLAKGTPAPQI 
SWIFPDRRVWQTVSPVESRITIJIENR^ 

AAIPPVfflQEKLEMSLPPGIJSIHmCTAKAAPLPSWWVLGDGTQIRPSQFlBGNIJ 
NGTLYIRNLAPKDSGRYECVAANLVGSARRTVQLNVQRAAANARITGTSPRRTDVRYG 
GTLKLDCS AS GDP WPRILWRLPSKRMID ALFSFDSRIK VFAN GTL V VKS VTDK 



DAGDYLCVARNKVGDDWVLKVDVVMKPAKffiHKEENDHKVFYGGDLKVDCVATGL 

PNPEISWSIPDGSLVNSFMQSDDSGGRTKRYVWNNGTLYFNEVGMREEGDYTCFAEN 

QVGKDEMRVRVKVWAPATmNKTYLAVQWYGDVVTVACEAKGEPMPKVTWLSPTN 

KVIPTSSEKYQIYQDGTIilQKAQRSDSGNYTCLVRNSAGEDRKTVWIHVNVQPPK^ 

NPNPITTVREIAAGGSRKLIDCKAEGIPTPRVLWAFPEGVVLPAPYYGNRITW 

RSLRKSDSVQLVCMARNEGGEAPJLIVQLTV^ 

AAGTnTSLVWVIJNGTDLQSGQQLQRFYHKADGMLfflSGLSSVDAGAYRCVARNAA 

GHTERLVSLKVGLKPEANKQYHNLVSIINGETLKIPCTPPGAGQGRFSWTIPNGMHI^G 

PQTLGRVSLIJDNGTLTVREASVFDRGTYVCRM^ 

YTPJ>GNTVKLNCMAMGIPKADITWELPDKSHLKAGVQARLYGNRFLHPQGSLTIQHA 
QRDAGFYKCMAKNILGSDSKTTYIHVF (SEQ ID NO: 21) 
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ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT 
TTGGGGCCAT CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT 
ACGTCCCCAG CGAGGTCCAC TGCACGTTCC GATCCCTGGC TTCCGTGCCC 
GCTGGCATTG CTAGACACGT GGAAAGAATC AATTTGGGGT TTAATAGCAT 
ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG TTGGAGCTAC 
TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT 
GATCACAGGA CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA 
TTGACCACAA CAAGATCGAG TTTATCCACC CTCAAGCTTT CAACGGCTTA 
ACGTCTCTGA GGCTACTCCA TTTGGAAGGA AATCTCCTCC ACCAGCTGCA 
CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC AGACTCTCCA 
CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA 
TCCGTGGACC TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA 
AATCCAGAGG AATTCTGAAG TGTAAA AAGG ACAAAGCTTA TGAAGGCGGT 
CAGTTGTGTG CAATGTGCTT CAGTCCAAAG AAGTTGTACA AACATGAGAT 
ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA GAGTCCCCTC 
TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGC A GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG 
CATCTCTTTG AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT 
GTGACATCAA GAAACCAATG GATGTGTACA AGATTCACTT GAACCAAACG 
GATCCTCCAG ATATTGACAT AAATGCAACA GTTGCCTTGG ACTTTGAGTG 
TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG ATAGCATACT 
ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA 
CTACACAGGT GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC 
AGCCATCCAT AGATATCCAG CTGAACCGAC GTCAGAGTAC GGCCAAGAAG 
GTGCTACTTT CCTACTACAC CCAGTATTCT CAAACAATAT CCACCAAAGA 
TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG CCTAGTGGAG 
CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA 
TGGCTCCATC CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA 
TTCTCAGCAG TGGCTGGCTG AGGATCAAGT CCATGGAGCC ATCTGACTCA 
GGCTTGTACC AGTGCATTGC TCAAGTGAGG GATGAAATGG ACCGCATGGT 
ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC GAGAAAGACA 
CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG 
GATAATTAAT GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG 
GAACTCTTTC CATCCCAAAG GTCCAAGTCA GTGATAGTGG 



TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC CATTTTACGG 
TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA 
GGATGAAGGG GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC 
TTCTGCATCC AAAGGACCAA GAGGTGTTCC TCAAAACAAA GGATGATGCC 
ATCAATGGAG ACAAGAAAGC CAAGAAAGGG AGAAGAAAGC TGAAACTCTG 
GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA GGTCGCAGAG 
TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA 
GGGCACAGAA GTACCCCCGT TGATTAAAAC CACAAGTCCT CC ATCCTTGA 
GCCTAGAAGT CACACCACCT TTTCCTGCTG TTTCTCCCCC CTCAGCATCT 
CCTGTGCAGA CAGTAACCAG TGCTGAAGAA TCCTCAGCAG ATGTACCTCT 
ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA GCCAGCATGG 
GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 
AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA 
GATAACTTCC ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA 
TATCTGAGCC TTATGAACCA TCTCCTACTC TGCACACATT AGACACAGTC 
TATGAAAAGC CCACCCATGA AGAGACGGCA ACAGAGGGTT GGTCTGCAGC 
AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT GAGCCTCCAT 
TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC 

CTTTGCACAC CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA 
CATCACAGTT ATTTGAGGAT TCTACTATAG GGGAACCAGG TGTCCCAGGC 
C AATCAC ATC TACAAGGACT GACAGACAAC ATCCACCTTG TGAAAAGTAG 
TCTAAGC ACT CAAGACACCT TACTGATTAA AAAGGGTATG AAAGAGATGT 
CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA 
CTCCACACTG GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG 
AAACCACAGT TGGTACCCTC CTAGACAAAG ACACCACAAC AGTAACAACA 
ACACCAAGGC AAAAAGTTGC TCCGTCATCC ACCATGAGCA CTCACCCTTC 
TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC AAATTCCGCC 
ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA 
GAGTTCTCTG GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA 
AACAGTTGGA AATGGAGAAG AATGCAGAAC CCACATCCAA GGGAACACCA 
CGGAGAAAAC ACGGGAAGAG GCCAAACAAA CATCGATATA CCCCTTGTAC 
AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT CCAGAAAATA 
AACATAGAAA CATTGTTACT CCCAGTTCAG AA ACTATACT TTTGCCTAGA 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC 
AACCACCAGA AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC 
CAGTCACATA 



TAAACCCACA TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG 
TTGACAAACA TAAAAGTGAC ATTTTAGTCA CTGGTGAATC AATTACTAAT 
GCCATACCAA CTTCTCGCTC CTTGGTCTCC ACTATGGGAG AATTTAAGGA 
AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCC AACCTGG AATCCCTCAA 
GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 
GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA 
- TTTCACTTCC GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC 
AGGAAGAAGC TGGTTCTTCC ACAACTCTCT CAAGCATAAA AGTGGAGGTG 
GCTTCAAGTC AGGCAGAAAC CACCACCCTT GATCAAGATC ATCTTGAAAC 
CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT CACACCCCTA 
CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 
ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG 
AATATCTCAA GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG 
TGGGGAATCC AGAAACAGAA GCAACCCCAG TCAACAATGA AGGAACACAG 
CATATGTCAG GGCCAAATGA ATTATCAACA CCCTCTTCCG ACCGGGATGC 
ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA TTTGGTAGTA 
GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 
GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC 
AACAGTGAGG CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG 
TAACTTCCCA GTCACCTCGT CACTGGACCA ACAAACCGGA AATAACTACA 

TATCCTTCTG GGGCTTTGCC AGAGAACAAA CAGTTTACAA CTCCAAGATT 
ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA CCCAGCATTC 
CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGA AACCCAG TTGGAAAGCC 
TCCCAGTCC A AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA 
CCAACAAGAC TCTTTCTTTT CCACAGTTGG GAGTCACCCG GAGACCCCAG 
ATACCCACTT CTCCTGCCCC AGTAATGAGA GAGAGAAAAG TTATTCCAGG 
TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG GACTTTGGCC 
CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 
ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC 
CTTTATAAC A TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT 
CAAAGTTCTT TGCAGGAGGA CCTCCTGCAT CCAAATTCTG GTCTCTTGGG 
GAAAAGCCCC AAATCCTCAC C AAGTCCCCA CAGACTGTGT CCGTCACCGC 
TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA CCAAAGCCTT 
TCGTTACTTG GAC AAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA 
GGTTCAAGTA CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC 
ACGGCCTGGA CAGGATGGTG GTCTTGCTTT CGGTCACCGT GCAGCAACCT 
CAAATCCTAG CCTCCCACTA CCAGGACGTC ACTGTCTACC TGGGAGACAC 
CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC CAAATTTCCT 
GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC 



CGTGGAGAGC CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG 
AGGCGTCCTT CTCAGACAGA GGCGTCTATA AGTGCGTGGC CAGCAATGCA 
GCCGGGGCGG ACAGCCTGGC CATCCGCCTG CACGTGGCGG CACTGCCCCC 
CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC CCGGGGCTCA 
GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG 
GAACTTGTTT GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC 
CCAAGGACAG CGGGCGCTAT GAGTGCGTGG CCGCCAACCT GGTAGGCTCC 
GCGCGCAGGA CGGTGCAGCT GAACGTGCAG CGTGCAGCAG CCAACGCGCG 
CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC GGAGGAACCC 
TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 
AGGCTGCCGT CC AAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG 
AATCAAGGTG TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA 
AAGATGCCGG AGATTACCTG TGCGTAGCTC GAAATAAGGT TGGTGATGAC 
TACGTGGTGC TCAAAGTGGA TGTGGTGATG AAACCGGCCA AGATTGAACA 
CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC CTGAAAGTGG 
ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTGCCA 
GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG 
CACCAAGCGC TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG 
TGGGGATGAG GGAGGAAGGA GACTACACCT GCTTTGCTGA AAATCAGGTC 
GGGAAGGACG AGATGAGAGT CAGAGTCAAG GTGGTGACAG CGCCCGCCAC 
CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT GGAGACGTGG 
TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 
TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT 
ATACCAAGAT GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG 
GCAACTACAC CTGCCTGGTC AGGAACAGCG CGGGAGAGGA TAGGAAGACG 
GTGTGGATTC ACGTCAACGT CCAGCCACCC AAGATCAACG GTAACCCCAA 
CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT CGGAAACTGA 
TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT 
CCATGGCAAC GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG 
TCCAGCTGGT ATGCATGGCA CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
GTGCAGCTCA CTGTCCTGGA GCCCATGGAG AAACCC ATCT TCCACGACCC 
GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC AGCCTCAACT 
GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 
GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC 
TGACGGCATG CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT 
ACCGCTGCGT GGCCCGCAAT GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
CTGAAGGTGG GACTGAAGCC AGAAGCAAAC AAGCAGTATC ATAACCTGGT 
CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC CCTCCCGGGG 
CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 



GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC 
GGTTCGTGAG GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG 
AGACGGAGTA CGGCCCTTCG GTCACCAGCA TCCCCGTGAT TGTGATCGCC 
TATCCTCCCC GGATCACCAG CGAGCCCACC CCGGTCATCT ACACCCGGCC 
CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT CCCAAAGCTG 
ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 
GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT 
CCAGCATGCC ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGC AAAAA 
ACATTCTCGG CAGTGACTCC AAAACAACTT ACATCCACGT CTTCTGAAAT 
GTGGATTCCA GAATGATTGC TTAGGAACTG ACAACAAAGC GGGGTTTGTA 
AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT GTCACAGTGC 
ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 
GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT 
GAGACACTTT CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG 
TCTGTGCTCT GACTGCAATT TTTCTTCTTT TGCAA ATGCC ACTCGACTGC 
CTTCATAAGC GTCCATAGGA TATCTGAGGA ACATTCATCA AAAATAAGCC 
ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA TCACCTAGTT 
AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACA AGTCA 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG 
GATTTAGAAC CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT 
TACATACATA CAGCTACCAT TTTATATGAA AAAAGAAAA A CATTTCTTCC 
TGGAACTCAC TTTTTATATA ATGTTTTATA TATATATTTT TTCCTTTCAA 
ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT ATTAAAATTA 
ATAAATTATT GGTCTTTACA AGACTTGGAT AC ATTACAGC AGACATGGAA 
ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC 
TGTTATATTA CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT 
AGATTTCCTT GTATGCAAAG TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
GAGAGGAGAG GAAGGAGAAA ACTGCATCAT AACTTTACAG AATTGAATCT 
AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT CTGGCTTGTC 
CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 
CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT 
TAAGGTCAAT ATACTGTACA TTTGATAATA AAATAATATT CTCCC AAAAA AAAAA 
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ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGCCTGGTCGCCACC 
CCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCTACGGAGGTACACTGCACATTTCGG 
TACCTGACTTCCATCCCAGACAGCATCCCGCCCAATGTGGAACGCATCAATTTAGGATACAACAGCTTG 
GTTAGATTGATGGAAACAGATTTTTCTGGCCTGACC^^ 

ATTC^CACAATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTATAAT 
AAAGTCCGAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGATTGCACATGGACCAC 
AACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGGGCTCAACTTTCTCCGCCTGGTGCACTTGGAA 
GGAAATCAGCTC^CTAAGCTCCACCC^GATACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATC 
TCTTTCATTAAGTTCCTATACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTAT 
ATGCCTGACCTAGACAGCCTTTACCTGC^TGGAAACCC^TGGACCTGTGATTGCCATTTAAAGTGGTTG 
TCTGACTGGATACAG GAGAAG CCAGATGTAATAAAATGCAAAAAAGATAGAAGTCCCTCTAGTGCTCAG 
CAGTGTCCACTTTGCATGAACCCTAGGACTTCTAAAGGCAAGCCGTTAGCTATGGTCTCAGCTGCAGCT 
TTCCAGTGTGCCAAGCCAACCATTGACTCATCCCTGAAATCAAAGAGCCTGACTATTCTGGAAGACAGT 
AGTTCTGCTTTCATCTCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGAT 
CAGTCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCCATTGCATTC 
ACTGAAGAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTTTTGGTGTGCAACATAGATTAC 
GGTCACATTCAGCCAGTGTGGCAAATTTTGGCTTTGTACAGTGATTCTCCTCTGATACTAGAAAGGAGC 
CACTTGCTTAGTGAAACACCGCAGCTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATT 
TTTACCAACATAGAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAG 
CTGAACAGAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCTCAAATCACTTTA 
CCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGACTATGATTTCAAGGGATAACAATACTAAGCTG 
GAACATACTGTCTTGGTAGGTGGAACCGTTGGCCTGAACTGCCCAGGCCAAGGAGACCCCACCCCACAC 
GTGGATTGGCTTCTAGCTGATGGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTA 
ATAGACAAAkGTGGAAAATTGGAACTCCAGATGGCT 

AGCAGCAATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAACCTTTGGTCGAAGCC 

TATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAAACACTTGATCTTCCATGCCATTCTACT 

GGTATCCCAGATGCCTCTATTAGCTGGGTTATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAG 

AAGAAAGTTCTAAACAATGGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGC 

TGTGTGGCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAAGGACAA 

AGGCCCTTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCCAATCCTATTGCTCATCTT 

AAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTGATGGAGGCTGAGGTTGGAAAACACACCTCA 

AGCACAAGTAAGAGGCACAACTATCGGGAATTAACACTC^ 

TTTAGGGAGAATAGGAGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTG 
TTGGAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGAAAATACCACAGTGAGCCCACCCCCAGTG 
GTCACCCAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTCAGGCATGCTCGCTCTACATGAGGAA 
TTTATGGTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGGACAGTGACTGCTGACTCCAGAACAATA 
TCTGATAGTCCTATGACAAACATAAATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTA 
CCACCTGAAGAACCCACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAATATA 
AACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATTCATCCACTGTCTTTCCACTGCTA 
CTTGGAGCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCC 
CCAATAACAGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCACCAACAAACTA 
TTATTAGAGTCAGTAAATACCACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCAC 
AATCACTTCTATTCTCACACTACTCAAATACT 

GCTCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAAC^TCCCGCTGTTCAGACGCTTTGGGAGG 
CAGAGGAAAATTGGCGGAAGGGGGCGGATTATCAGCCCATATAGAACTCCAGTTCTGCGACGGCATAGA 
TACAGCATTTTCAGGTCAACAACCAGAGGTTCTTCT 
CTCAATGTGAGATGTCTGTCCTGTCTTCCC^ 

CCAAGTGCTGCTCCC^TCACCTTCCCCAAAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCTACAACT 
CTAGTCCAGAATCCACTATTACTACTTGAGA 

TATTTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGTGCAGTCATGACATATGCTCCAACATCCATA 
CCCATGGAAAAAACTCACAAAGTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGA 
GATTCAGTGATTACATCGTC^CTTTCA^ 

AGGTTTTCAAGAAGGAAAATTCCCTGGC^OVGAACTTTGTAAATAACCATAACCeAAAAGGCAGATTA 
AGGAATCAACATAAAGTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCTTTA 
CC^CAGAGACAAAGTTCCCCTTTCCATT^ 

ACCTTGACTACCGCTCACCACACTACGACCAAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAG 
CTTCCCTTCCCACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAAAAAGCATC 
ATATCAACGGAAACAGCAATACCAGCAACAACTCCTACCTTCCCTGCATCTGTCATCACTTATGAAACC 
CAAACAGAGAGATCTAGAGCACAAACAATACAAAGAGAACAG 

CCAAACATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACACCTCCTGCTCTGGCA 



TTCACTCATTCCCCACCAGAAAACACAACTGGGATTTCAA 
AATCTGACAGATGTGATTGAAGAACTAGCCCAAGCAAGTACTCA^^ 

GAAACAACTTTGTCCAGCAAATCACACCAGAGTACCAC^CTAGGAAAGCATCATTAGACACTCCCATA 

CCACCATTCTTGAGC^GCAGTGCTACTCTAATGCCAGTTCCCATCTCCCCTCCCTTTACTCAGAGAGCA 

GTTACTGACACACGTGGCGACTCCC^TTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCC 

TCAAGGCACAATCTCCAAATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTA 

CATTCTACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGG 

GCAGAATACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAGCCAGAAGTAAGC 

ATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCTTGTTTCAGATTGGGATGGACAGAAGAACACA 

AAGAAGAGTGACTTTGATAAGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCT 

TTGTCTAGGTATATATTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCAAGTTTTACTATTCCAGCT 

AACTCAGATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCGCACCATTCATTGGACCAGAGTT 

TCAGGACTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCTGTCC 

ATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCGCATCCAATCTGTTTGGCACAGAC 

CACCTTCATGTCACCTTGTCTGTGGTTTCCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATC 

ACAGTTCATTCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACC 

TGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCTGTGGTGACGGTT 

GACGGAACATTGGTCOTCCAC^TCTC^GTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCA^ 

CCAGGTGGCCAGGATTC^CTGCTGGTTAAAATACAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAA 

AGGAGGCAAGTCATTGTAGGCACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCT 

CAGCCCAGCGTTTACTGGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCCAAG 

TTGTTCTTATTTTGAAATGGGACTTTGTATATAAGAAACCTAGCCTCTTCAGACAGGGGCACTTATGAA 

TGCATTGCTACCAGTTCCACTGGTTCGGAGCGAAGAGTAGTAATGCTTACAATGGAAGAGCGAGTGACC 

AGCCCCAGGATAGAAGCTGCATCCCAGAAAAGGACTGAAGTGAATTTTGGGGACAAATTACTACTGAAC 

TGCTCAGCCACTGGGGAGCCCAAACCCCAAATAATGTGGAGGTTACCATCCAAGGCTGTGGTCGACCAG 

TGGAGCTGGATCCACGTCTACCCTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGACAGTGGT 

GTCTACrTGTGTGTGGCAAGAAAC^AAATGGGGGATGATCTGATACTGATGCATGTTAGCCTAAGACTG 

AAACCTGCCAAAATTGACCACAAGCAGTATTTTAGAAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTA 

GATTGCAAAGCTTCCGGCTCCCCAGTGCCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAAC 

AATGCAATG<^GCCGATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAAC^TGGAACTTTA 

TACTTCAACAAAGTTGGGGTAGCGGAGGAAGGAGATTATACTTGCTATGCCCAGAACACCCTAGGGAAA 

GATGAAATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATAAGGCAGAGTAACAAAACCAAC 

AAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGACTGTGAGGTCACTGGGGATCCCAAACCaAAAATA 

TTTTGGTTGCTGCCTTCCAATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGG 

TCTTTGACCAT(^C^AAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGT 

GGGGATGACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAATGGTCTGTATACA 

AACAGAACTGTTATTAAAGCCACAGCTGTGAGACATTCOWUIAACACTTTGACTGCAGAGCTGAAGGG 

ACACCATCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTTCCTCACAGCCCCATACTATGGAAGC 

AGAATCACAGTCCATAAAAATGGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTT 

ATCTGTGTGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATGCTG 

AGAAGACCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGAAAGTCCACAGCATTG 

AATTGCTCTGTTGATGGTAACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCC 

AATGGACCACAAAGTTATCAGTATCTGATAGC^GCAATGGTTCTTTTATCATTTCTAAAACAACTCGG 

GAGGATGCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTC^TATTA 

GAAATTGGCCAGAAGCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAGAATCT 

CTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGGACTATGCCAAGTGGTTAT 

GTAGTAGACAGGCCTCAAATTAATGGGAAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAA 

GCAACAGCTTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACT 

GTTCCAGTAATGATTGTAGCCTACCCTCCCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGG 

ACAGGGGCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATG 

CCTGACCACTCCCTTCTCTGAACGGG^GTAAAGAGAGGACACATCGAAGTGAGCAGCTTCACTTACAA 

GGTACCCTAGTCATTCAGAATCCCC^CCTCCGATTCTGGGATATAO^TGCACAGCAAAGAACCCA 

CTTGGTAGTGATTATGCAGCAACGTATATTCAAGTAATCTGA 
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ORIGIN 






1 


MKVKGRGITC 


LLVSFAVICL 


51 


IPDSIPPNVE 


RINLGYNSLV 


101 


FSDLQALQVIi 


KMSYNKVRKL 


151 


GLNFLRLVHL 


EGNQLTKLHP 


201 


PQEMVSYMPD 


LDSLYIiHGUP 


251 


SAQQCPLCMN 


PRTSKGKPLA 


301 


SAFISPQGFM 


APFGSLTLNM 


351 


YIVLNTSFST 


FLVCNIDYGH 


401 


YYKYKQVAPK 


PEDIFTNIEA 


451 


QYSSDAQITL 


PRAEMRPVKH 


501 


GDPTPHVDWL 


LADGSKVRAP 


551 


CISSNYDDAD 


ILTYRITWE 


601 


PDASISWIP 


GMNVLYQSSR 


651 


SGVDFLIFQV 


SVKMKGQRPL 


701 


SALMEAEVGK 


HTSSTSKRHN 


751 


IDPQHWAALL 


EKAKKNAMPD 


801 


ALHEEFMVPA 


TKALNLPART 


851 


LPPEEPTDFK 


LSTAIKTTAM 


901 


TEFQDSDQMG 


RGREHFQSRP 


951 


TNSHQTSVRE 


VSEPRHNHFY 


1001 


TVNIPLFRRF 


GRQRKIGGRG 


1051 


TTAFSATVXiN 


VTCLSCLPRE 


1101 


ESTTLVQNPL 


LLLENKPSVE 


1151 


PMEKTHKVNA 


SYPRVSSTNE 


1201 


RKIPWQQNFV 


NNHNPKGRLR 


1251 


HFTTLSTSVM 


QIPSNTLTTA 


1301 


IISKDSSTKS 


IISTQTAIPA 


1351 


QKKNRTDPNI 


SPDQSSGFTT 


1401 


RTLNLTDVIE 


ELAQASTQTL 


1451 


PFLSSSATLM 


PVPISPPFTQ 


1501 


QMPSSQLEPL 


TSSTSNLLHS 


1551 


YSDIAEKGKK 


PEVSMIiATTG 


1601 


TTSKLLPFDS 


LSRYIFEKPR 


1651 


IHWTRVSGLD 


LSRGNQNSRV 


1701 


TDHLHVTLSV 


VSYPPRILER 


J. / Z> J. 


LANQTWSES 


SQGSRQAWT 


Xoui 


LLVKIQVIAA PPVILEQRRQ 




GTEVKPLQFT HSKLFLFSXG 




MLTMEERVTS 


PRIEAASQKR 


1951 


AWDQWSWIH VYPNGSLFIG 


2001 


LKPAKIDHKQ 


YFRKQVLHGK 


2051 


QADDSGHRTR RYTLFNNGTL 


2101 


TVITAAPRIR 


QSNKTNKRIK 


2151 


FSIDRYTFHA 


NGSLTINKVK 


2201 


PLINGLYTNR 


TVIKATAVRH 


2251 


YYGSRITVHK NGTLEIRNVR 


2301 


RRPTFRNPFN 


EKIVAQLGKS 


2351 


SYQYLIASNG 


SFIISKTTRE 


2401 


L.TYAPGTVKG 


ISGESLSLHC 


2451 


LHDNGTLVIK 


EATAYDRGNY 


2501 


PRSIVTRTGA AFQLHCVALG 


2551 


HLQGTLVIQN 


PQTSDSGIYK 



VATPGGKACP RRCACYMPTE VHCTFRYLTS 
RLMETDFSGL TKLELLMLHS NGIHTIPDKT 
QKDTFYGLRS LTRLHMDHNN IEFINPEVFY 
DTFVSLSYLQ IFKISFIKFL YLSDNFLTSL 
WTCDCHLKWL SDWIQEKPDV IKCKKDRSPS 
MVSAAAFQCA KPTIDSSLKS KSLTILEDSS 
TDQSGNEANM VCSIQKPSRT SPIAFTEEND 
IQPVWQILAL YSDSPLILER SHLLSETPQL 
DLRADPSWLM QDQISLQIjNR TATTFSTLQI 
KWTMISRDNN TKLEHTVLVG GTVGLNCPGQ 
YVSEDGRIIiI DKSGKLEIiQM ADSFDTGVYH 
PLVEAYQENG IHHTVFIGET LDLPCHSTGI 
DKKVLNNGTL RILQVTPKDQ GYYRCVAANP 
EHDGETEGSG LDESNPIAHL KEPpGAQLRT 
YRELTLQRRG DSTHRRFREN RRHFPPSARR 
KRENTTVSPP PWTQLPNIP GEEDDSSGML 
VTADSRTISD SPMTNINYGT EFSPWNSQI 
SKNINPTMSS QIQGTTNQHS STVFPLLLGA 
PITVRTMIKD VNVKMLSSTT NKLLLESVNT 
SHTTQILSTS TFPSDPHTAA HSQFPIPRNS 
RIISPYRTPV LRRHRYSIFR STTRGSSEKS 
RLTTATAALS FPSAAPITFP KADIARVPSE 
KTTPTTKYFR TEISQVTPTG AVMTYAPTSI 
AKRDSVITSS LSGAITKPPM TIIAITRFSR 
NQHKVSLQKS TAVMLPKTSP ALPQRQSSPF 
HHTTTKTHNP GSLPTKKELP FPPLNPMLPS 
TTPTFPASVI TYETQTERSR AQTIQREQEP 
PTAMTPPALA FTHSPPENTT GISSTISFHS 
KSTIASETTL SSKSHQSTTT RKASLDTPIP 
RAVTDTRGDS HFRLMTNTW KLHESSRHNL 
TPMPALTTVK SQNSKLTPSP WAEYQFWHKP 
LSEATTLVSD WDGQKNTKKS DFDKKPVQEA 
IVGGKAASFT IPANSDAFLP CEAVGNPLPT 
QVLPNGTLSI QRVEIQDRGQ YLCSASNLFG 
RTKEITVHSG STVELKCRAE GRPSPTVTWI 
wsroraas^ ^\m^TK. -VC%£&RQ££S& 
VIVGTWGESL KLPCTAKGTP QPSVYWVLSD 
TLYIRNIASS DRGTYEC I AT SSTGSERRW 
TEVNFGDKLL LNCSATGEPK PQIMWRLPSK 
SVTEKDSGVY LCVARNKMGD DLILMHVSLR 
PFQVDCKASG SPVPEISWSL PDGTMINNAM 
YFNKVGVAEE GDYTCYAQNT LGKDEMKVHL 
AGDTAVLDCE VTGDPKPKIF WLLPSNBMIS 
LIiDSGEYVCV ARNPSGDDTK MYKLDWSKP 
SKKHFDCRAE GTPSPEVMWI MPDNIFLTAP 
LSDSADFICV ARNEGGESVL WQLEVLEML 
TAIiNCSVDGN PPPEIIWILP UGTRFSNGPQ 
DAGKYRCAAR NKVGYIEKLV ILEIGQKPVI 
VSDGIPKPNI KWTMPSGYW DRPQINGKYI 
ICKAQNSVGH TL1TVPVMIV AYPPRITNRP 
VPKPEITWEM PDHSLLSTAS KERTHGSEQL 
CTAKNPLGSD YAATYIQVI* 



Figure 55 

OCP rat amino acid sequence 

>60S-663Nterm Rat Protein (663 aa) 

MQVRGREVS GLLI S LTAVCLVVTPGSRACPRRCACYVPTEVHCTFRYLTS I PDG I PANVE 
RINLGYNSLTRLTENDFDGLSKLELL^HSNGIHRVSDKTFSGLQSLQVLKMSYNKVQII 
RKDTFyGIiGSLVRLHLDHmiEFINPEAFYGLTSLRLVHLEGNRLTKLHPDTFVSLSYIiQ 
IFKTSFIKYLFLSDNFLTSLPKEMVSYMPNLESLYLHGNPWTCDCHLKWLSEWMQGNPDI 
IKCKKDRSSSSPQQCPLCMNPRISKGRPFAMVPSGAFLCTKPTIDPSLKSKSLVTQEDNG 
SASTSPQDFIEPFGSLSLNMTXXSGNKADMVCSIQKPSRTSPTAFTEENDYIMIiNASFST 
NLVCSVDYNHIQPVWQLIJUjYSDSPLILERKPQLTETPSLSSRYKQVALRPEDIFTSIEA 
DVRADPFWFQQEKIVLQLNRTATTLSTLQI^^ 

PKIiERTVLVGGTIALSCPGKGDPSPHLEWLLADGSKVRAPYVSEDGRILIDKNGKLELQM 
ADSFDAGLYHCISTKDADADVLTYRITVVEPYGESTHDSGVQHTVV^ 
PDASISWILPGNTVFSQPSRDRQILNNGTLRILQVTPKDQGHYQCVAANPSGADFSSFKV 
SVQ 
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pCM-H-608-663-N-term 



Swal 
Fsel 




hl©an_5 +3_cor r ected TTSKLLPFD SLSRYI FEKP&IVGGKAASFTI PAKSKlFI.PCEAVGNPLP2X^fTKVS-G[j 
rouss 5 corrected — „ 



butaan_543_correctftd DI£BjGKQNSRvmm3^I^ 

ttou&e_5jcorr*ctsd . i 

ioouse w 5_corrected ~ - — - — : „ 

rat LSLYDRGF^XCVAS^fiGQI^IiVKIQVtl T&££VX IEQKRQAIVGVLGGSLKLK?rAKGT 

huiaan_S+3_corr©ct€d LSIYDRGFyKCVASHtt^DS^^ 

kous e_5_cor r ect ed : 

human_5+3_correote<i ^svywvlsdotsvkfLOMSK^^ 

mo\xz e_5_coxrect ed . 

rat VTLTVEEGETIPRIKTkSQKiro 

human_5+3_corrected VMLTOESRVTSPRII^^QKRIEV^ 

mouse_5_corrected — . * 

»ttf*Jjeorrected 

«sous© 5 corrected , 



2iUJaaa_5+3_COrrected GVAEEGDYTCY2£M!lrf^^ 

nouse_5_eorrected , 

moua€^5joorrftcted — «. — 

»ottse_5jcorrected _ 

5 at VTVHPNGTLQ^IBI^DS^FTCV^ 

E»us*_5jcorrected , 

huHian_5+3_corrected QWKSra^CXVK^ 

mow^^jcorreoted ( ^. 

£?JL, ^ , KCA^NKVGYI EKLILLEI GOJOTILT^E ^GMVKSVSGEPLSliHGVS DG I PKFNYKWTTP 



rat 

hujaan_S+3_correcte<l 
mouse 5 corrected 



rat IHti&B&a^GEaNQ^^ 

mouse 5 corrected ^^—.—^—.^r-—-*— — — — 



^ himan_5+3_COrreCted LYIQNPQTSDSGIYKCTAKNPLGSDYAAJyiQV 

-J rnous e_5_CQ rrected — 

; 0 (rat: SEQ ID NO: 10) 

p (human_5+3_corrected: SEQ ID NO: 1 1) 



(mouse_5_corrected: SEQ ID NO: 12) 



